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ABSTRACT 


Methods of interpolation and quadrature have been used for over 300 years. Im- 
provements in the techniques have been made by many, most notably by Gauss, whose 
technique applied to polynomials is referred to as Gaussian Quadrature. Stieltjes 
extended Gauss’s method to certain non-polynomial functions as early as 1884. Con- 
ditions that guarantee the existence of quadrature formulas for certain collections of 
functions were studied by Tchebycheff, and his work was extended by others. Today, 
a class of functions which satisfies these conditions is called a Tchebycheff System. 
This thesis contains the definition of a Tchebycheff System, along with the theorems, 
proofs, and definitions necessary to guarantee the existence of quadrature formulas 
for such systems. 

Solutions of discretely observable linear control systems are of particular interest, 
and observability with respect to a given output function is defined. The output 
function is written as a linear combination of a collection of orthonormal functions. 
Orthonormal functions are defined, and their properties are discussed. 

The technique for evaluating the coefficients in the output function involves eval- 
uating the definite integral of functions which can be shown to form a Tchebycheff 
system. Therefore, quadrature formulas for these integrals exist, and in many cases 
are known. 

The technique given is useful in cases where the method of direct calculation is 
unstable. The condition number of a matrix is defined and shown to be an indication 
of the the degree to which perturbations in data affect the accuracy of the solution. 
In special cases, the number of data points required for direct calculation is the same 
as the number required by the method presented in this thesis. But the method is 
shown to require more data points in other cases. A lower bound for the number of 
data points required is given. 


in 



CHAPTER I 
INTRODUCTION 


In the study of discretely observable linear control systems, a system of first order 
linear differential equations x= Ax is given, where A is an n x n constant matrix, and 
x(t) = (xx(t),X2(t),* • • ,x n (t)) € R n for t in an interval /. An output function y is a 

linear combination of the coordinates of x; i. e., y(t) = c 1 x 1 (t) + c 2 x 2 (<)H f CnX n (<). 

A system is said to be observable if the initial value x(t 0 ) is uniquely determined by 
the output function y. The system is said to be discretely observable if the values of 
y(t) and some of its derivatives are known for a discrete set of values of t. (The data 
points required will be defined in Chapter V.) 

The output function y typically has the form 

y(t) = ( L1 ) 

1=1 

where for each i, r, is a polynomial to be determined, and A, is a constant. The 
polynomials in the solution can be calculated directly if an adequate number of data 
points are given. But two problems arise if the dimension of the system is large. First, 
a direct calculation would require the inversion of a matrix of high order, and this 
might not be practical. Second, even if the inversion of the matrix is carried out, the 
accuracy of the solution depends upon the accuracy of the data being used. For some 
matrices, slight perturbations in the data can lead to large errors in the solution. 

In this thesis, the solution y will be re-formulated using a new matrix known to be 
stable; that is, one which is minimally sensitive to perturbations in the data. Also, 
by using a collection of orthogonal functions, the constants in the solution can be 
expressed as integrals which can be calculated from the data points using quadrature 
methods, thus avoiding the necessity of actually inverting the matrix. 

Many methods of approximating the value of a definite integral have long been 
known. Integration formulas of interpolatory type are those found by approximating 
an integrand with a polynomial and then integrating the polynomial. The approxi- 
mating polynomial is one for which the values of the polynomial and the function are 
equal at a set of distinct points in the interval of integration. This method yields the 
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approximating formula 

/ f(x)dx S3 ^U>jfc/(Xjk)- 

• // Jt=0 

If the function / is a polynomial of degree less than or equal to n then the formula is 
exact. 

Functions other than polynomials have the property that their integrals can be 
expressed exactly in this way. Suppose / is a function and t\, • ■ ■ t n are points in 

an interval /.If 

f f(x)dx = £>*/(x*) 

JI k = o 

then Tlk=o w kf( x k) is said to be a quadrature formula. The points ti,t 2 ,- • • ,t n are 
called nodes, and the constants W\, u? 2 , - * • , w n are called weights. 

In general, there exist classes of functions for which a single quadrature formula 
is exact for each function in the class. Let u 1? U 2 , ■ ■ ■ , u n , • • • be functions which are 
integrable over an interval I. If there exist points • • ,t m in I and constants 

w u w 2 i • • • , such that 

[ m 

/ Ui{t)dt = 

JI Jt=0 

for each * = 1,2, • • ■ , n then the rule will be said to have degree of exactness n. If the 
functions u, are the polynomials 1, t, t 2 , t 3 , • • •, and *i, t 2 , • • • , <„ are distinct points in 
I , there exist weights Wi,W 2 , • • • , w n such that for all polynomials / of degree less 
than or equal to n — 1 the quadrature formula is exact. If the points tj, • * • , t n are 
chosen appropriately, the degree of exactness of the formula can be as great as 2n — 1 . 

In the latter part of the 17th century, Newton approximated a function / by 
constructing the unique polynomial of degree less than or equal to n — 1 which passed 
through a set of n distinct points of the function. The polynomial was expressed in 
terms of divided differences, but the form which we find convenient today is the one 
devised in 1795 by Lagrange. 

After learning of Newton’s ideas, Roger Cotes approximated the integral of a 
function with the integral of the interpolating polynomial which agreed with the 
function at equally spaced nodes. He computed the weights wi , W 2 , • • • , w n in the 
quadrature formula for all n < 11. The trapezoidal and Simpson rules are special 


cases. 
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Gauss raised the question of what the maximum degree of exactness would be 
if the nodes could be chosen arbitrarily. He used his theory of continued fractions 
associated with hypergeometric series to show in 1814 a way of choosing n nodes so 
that the quadrature formula is exact for all polynomials of degree less than or equal 
to 2n — 1. 

Jacobi showed in 1826 that if the nodes " ,t n can be chosen so that the 

node polynomial w n , given by 

U n {t) = {t-t x ){t-t *)•••(< -<n) 

is orthogonal to all polynomials of degree less than or equal to k — 1, then the quadra- 
ture rule has degree of exactness n — 1 + k. (A polynomial of degree —1 is taken to be 
identically zero.) Since u) n cannot be orthogonal to itself, k < n, and the maximum 
degree of exactness possible is 2n — 1. 

It was not until the latter half of the nineteenth century that the work of Gauss 
and Jacobi was extended to weighted integrals of the form 

[ b f(t)da(t)= f f(t)w(t)dt. 

J a J a 

The technique of Christoffel (1877) depended upon the generation of orthogonal poly- 
nomials, and he showed that they satisfy a three-term recurrence relation. 

In 1884, Stieltjes extended Gaussian quadrature to functions other than polyno- 
mials. In particular, he established formulas for the class of functions Uk(t) = t° k , 
where 0 < Qi < a 2 < a 3 < • • •, on the interval [0, 1]. 

The theory can be extended to a wide collection of classes of functions. In par- 
ticular, quadrature formulas theoretically exist for any class of functions which forms 
what is now called a Tchebycheff system. (Such a system will be defined in Chapter 
6.) Tchebycheff studied these systems in the latter half of the nineteenth century, 
and his work was extended by others, including M. G. Krein, S. Karlin, and L. S. 
Shapley. A comprehensive presentation of Tchebycheff systems is given in [28]. 

The output function (1.1) is the sum of functions of the form 

u : ,k{t) = t J exp(A fcf), for nonnegative integers,; and k. (1.2) 

This class of functions forms a Tchebycheff system, and the method of computing 
the polynomials r,(f) to be presented in this thesis will require quadrature formulas 
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for integrals of functions of the form (1.2). The definitions and properties of orthog- 
onal functions, Gaussian quadrature, stability of matrices, observability, Tchebycheff 
systems, and exponential interpolation will be discussed before discussing the appli- 
cation of these ideas to the determination of the output function (1.1). 



CHAPTER II 

ORTHOGONAL FUNCTIONS 


In many applications, a collection of orthogonal functions can be used to demon- 
strate existence of solutions and to facilitate computations. Because of their impor- 
tance in quadrature methods, a summary of some of their properties will be presented 
in this chapter. For a more detailed discussion of most of what follows, see [9]. 

Definition 2.1 Let X be an inner product space with inner product of f and g de- 
noted by (f,g). Two elements f,g £ X are said to be orthogonal if (f,g) = 0. A 
subset S of X is said to be an orthogonal set if for all f,g € S such that f g, f 
and g are orthogonal. 

Definition 2.2 Let S be an orthogonal subset of an inner product space X . If 
(/, /) = 1 for all f e S, thenS is said to be orthonormal. 

A closed bounded interval [a, 6] and a real-valued function w determine an inner 
product space defined as follows. 

Definition 2.3 Let X = C[a, 6]. Let w belong to X and w(x) > 0 on [a, 6], For each 
pair f,g € X, define 

(fid) = faf(t)g(t)w( t)dt. (2.1) 

Then ( f,g ) exists for all f,g £ X and is called the inner product of f and g with 
respect to the weight function w. 

The space X defined above is an inner product space, and a set {<j>k} of orthogonal 
polynomials with respect to a positive weight function w can always be found, where 
the degree of <f>k is equal to k. Inner product spaces also exist in case the interval of 
integration is infinite if the set X is restricted to integrable functions. 

One method of generating a set of orthonormal polynomials is the Gram-Schmidt 
process. This process can be applied to any sequence of functions Xi,x 2 , • • • , provided 
the set {xj, X 2 , • • • , is linearly independent for each k. Assume Xo(f) = 1 and 
xjt(f) = t k for each positive integer k. Define 

= {xlilyn 
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y t {t) = x t (t) - (®i,^o)^o(0 
= (yii/i) )l/2 


y*(0 = Xk(t) — y~!(jfc, 4>k-i)4>k-i 

k=0 

^ = (yUm ) 1 ' 2 


The set {<£ 0 , <^i> • • • , * • •} is orthonormal. For each k, the degree of <f>k is equal to 

k , and the leading coefficient of <f>k is positive. The following conditions also hold. 

1. For each <f>k{t), there exist constants ao, fli, ■ • • , a/t such that 

= a oXo{t) + QiXi(i) + • • • + cikXk(t). 

2. For each x*, there exist constants b 0 , • • • , such that 

2k{t) = bQ<t>o(t) + b\<f>\{t) + ■ • ■ + bk<f>k{t)- 

This implies that <f > jt is orthogonal to every polynomial of degree less than k. Suppose 
{<i>k} and {xjJ axe collections of polynomials for which condition (1) above holds. If 
{<^*} is an orthonormal set of polynomials with positive leading coefficients, then it 
is precisely the set of polynomials which is generated from the collection {x^} by the 
Gram-Schmidt process. 

Real orthonormal polynomials satisfy a recursion relation, and the following theo- 
rem provides a method of computing the polynomials if the constants in the recursion 
relation can be calculated. 

Theorem 2.1 Real orthonormal polynomials satisfy the following three term recur- 
sion relation. 

4>n(t) = (a n t + j3 n )<f> n -i(t ) - 7 n<t>n-2{t) 

The following theorem gives a method of constructing an orthogonal set of monic 
polynomials. In this method, constants Co,Cj,c 2 , •• • can be defined by 

°0 = 

c n = f h J n w{t)dt, 71 = 1,2,---, 
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but the theorem is valid for more general sequences {c„}, provided 


Co 

Cl 

Cl ••• 

c 2 ••• 

Cn— 1 1 

Cn 

^0 for n = 1,2, • • • 

(2.2) 

c„-l 

Cn * * * 

C2n-2 ) 




In general, let c* be the linear functional on the space of real polynomials defined by 

c m (t n ) = n = 0, 1,2, • • • . 

An inner product on this space is defined by 

<P 5 ?) = c m (p(t)q(t)) (2.3) 


Theorem 2.2 There exists a unique sequence of monic polynomials which form an 
orthogonal set with respect to the inner product [2.3]. The polynomials are given by 
the following, where s n is defined for n = 1, 2, ■ • • by [2.2]: 

^ Co Cl • • ■ Cn 

J C 1 c 2 C n+ 1 

<tn = — det 
•S„ 

Cn— 1 C n • • • C2n—\ 

1 t ••• t n 

The sequence of polynomials {g n } determined in the above theorem satisfies the three 
term recurrence relation 



9-1 (0 = 0, q 0 {t) = 1, 

9n+i(t) = (t — a n+ i)q n (t) — 0* q n -x(t) (2.4) 

n = 0, 1,2,- • • 

An algorithm for computing the constants a n and 0 n is given in [19]. These polyno- 
mials are called Lanczos polynomials of the first kind for {c„}. Lanczos polynomials 
of the second kind for {cn}, denoted by {p n }, are generated if the initial conditions 
in [2.4] are altered as follows: 

P- 1 = -1, Po(t) = 0, 

Pn+i(t) = (t - a n+l )p n {t) - 0 2 nPn -i(t), (2.5) 

n = 0,1,2,--- 
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The following classical examples of real orthogonal polynomials are generated by 
the inner product [2.1], where [a, 6] = [—1,1]. 

1. The weight function w(t) = 1 yields the Legendre polynomials. 

2. The weight function (1 — t 2 ) -1 / 2 produces Tschebysheff polynomials of the first 
kind. 

3. The weight function (1 — t 2 ) 1 ^ 2 produces Tschebysheff polynomials of the second 
kind. 

4. Jacobi polynomials are produced by the weight function (1 — £)°(1 + t) 13 , where 
q and 0 are constants greater than —1. 

For some weight functions, the interval may be chosen to be infinite. For example, 

1. generalized Laguerre polynomials result when the interval is [0,oc) and the 
weight function is t Q e~\ where a is a constant greater than —1, and 

2. Hermite polynomials occur for an interval (— oo, oo) and a weight function e - ' 2 . 

An important property of the zeros of real orthogonal polynomials with respect 
to the inner product [2.1] is given in the following theorem. 

Theorem 2.3 The zeros of real orthogonal polynomials are real, distinct, and are 
located in the open interval (a, 6). 

The following definition of the kernel polynomial of an orthonormal system and the 
Christoffel-Darboux theorem lead to an interlacing property of the zeros of consecutive 
polynomials. 

Definition 2.4 Let {<t> n } be a system of real orthonormal polynomials. The function 

k = 0 

is called the kernel polynomial of order n of the orthonormal system. 
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Theorem 2.4 (Christoffel-Darboux) Let {<p„} be a set of real orthonormal poly- 
nomials, where 

<t>n{t) = a nn t n + a n (n-i)* n 1 + • • • + a»»o 


for n = 0, 1,2, •••. Then 


K (t 3 ) = f °nn \ f 0n(O^n-H (■ 3 ) 

’ ] V a (n+l)(n+l)/ V is 

andf^iMi)} 2 = ) (#»+ i(0A»(0 - ^(O^t-i(O) 

fc=o \ u (n-fl)(n+l)/ 


Using the Christoffel-Darboux theorem, the following can be proved. 


Theorem 2.5 Let be a collection of orthogonal polynomials where , for n = 

0,1,2, •••, the degree of <f> n is equal to n. Let k be a positive integer. By theorem 
[2.3], <f>k + 1 has k + 1 real distinct zeros. If A x and A 2 are consecutive zeros of 4>k+i, 
then <f> k has a zero between A x and A 2 . 



CHAPTER III 
GAUSSIAN QUADRATURE 


The problem of approximating a definite integral has been studied extensively, and 
many methods have been developed. The method of quadrature takes advantage of 
the fact that the polynomials are dense in the set of all functions which are continuous 
on a given interval. If the value of a function / is known at points < 1 , t 2 , • • • , t n in 
the interval of integration, a polynomial of degree n — 1 exists which agrees with 
the function / at each of these points. Such a polynomial is called an interpolating 
polynomial for /, and is said to interpolate / at the points t x , t 2 , • • • , t n . The integral of 
this interpolating polynomial is then an approximation of the integral of the function 
/• 

The computation of the integral of an interpolating polynomial for the function f 
at points t\, < 2 , • • • ,t n requires the determination of constants Ax, A 2 , • • • , A n as well as 
the values f(t\),f(t 2 ), • • • f{t n ). The integral is then given by the quadrature formula 
5Z)t=i A kf{tk)- If two points ti and t 2 are used, the interpolating polynomial is linear, 
and the quadrature formula is a special case of the trapezoidal rule. If three equally 
spaced points are used, the quadrature formula is a special case of Simpson’s rule. 

An advantage of the quadrature method is that the quadrature formula is exact 
for all functions which are polynomials of degree less than or equal to n — 1. If 
some of the points t \ , t 2 , •• • , t n are not predetermined, they can be chosen so that the 
quadrature formula will be exact for polynomials of higher degree. 

In general, suppose ii 1 ,u 2 ,---u n are continuous real- valued functions defined on 
a closed finite interval [a, b] containing the points t u t 2 , • • • , t m . Let A 1 ,A 2 ,---,A m be 
constants such that for each k = 1, 2, • • • , n, 

fa u k(t)dt = jr Xjt 

j=i 

If the function / is a linear combination of the functions Mi, u 2 , • • • ,n„, then 

j=i 

The constants A x , A 2 , • ■ • , A m are called weights , and the points £i, t 2 , ■ • • , t m are called 
nodes. Necessary and sufficient conditions for the existence of such nodes and weights 
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for a given set of functions, along with the number of nodes and weights required, are 
given in [ 28 ] and will be discussed elsewhere in this paper. 

Consider the particular case in which the functions to be integrated are poly- 
nomials. If the nodes are pre-determined, then n of them will be required for the 
quadrature formula to be exact for all polynomials of degree less than or equal to 
n — 1. If, however, all the nodes may be selected appropriately, then n nodes will give 
a quadrature formula exact for all polynomials of degree less than or equal to 2n — 1 . 

Let n be a positive integer and the set of all polynomials of degree less than 
or equal to n — 1. Given n distinct points t\, <2, ■ • • t n in the interval [a, 6] and n values 
3/1, • ■ • , y n , there exists a unique polynomial p € !P n _i for which 


P(tk) = yk k = 1 , 2 , • • • ,n. 


The polynomial p can be represented by the Lagrange formula as follows: 

p(0 = EjA(<), 

k=l 


where for each k , t k is the Lagrange polynomial defined by 


m 


(t - - h) • • • {t _ - u+i) ■ • • {t ~ t n ) 

( t k ~ h )(t k ~ t 2 ) • • • ( t k ~ t k -l){tk - t k+ 1) • ••(**- t n ) ' 


( 3 . 1 ) 


Then n 

f*p(t)dt = X kVk 
k = 1 

where for each k = 1 , 2 , • • • , n, 

A* = S h Jk{t)dt. 

Since the scalar values depend only upon the nodes • • • , then for each 

polynomial in 1, the integral can be evaluated if the value of the polynomial is 
known at each of these nodes. 

Suppose the nodes • ,t n in the above method are not pre-determined. It 

was found by Gauss that they can be selected so that the quadrature formula is exact 
for all polynomials in fyn-i- In order to determine such nodes, a set of orthogonal 
polynomials is defined, and the nodes are roots of the polynomial of degree n. 

By the theory of orthogonal polynomials, if {<p 0 > 4>2, • • • , <?„} is a set of real orthog- 
onal polynomials over the interval [a, 6] such that for each k = 0, 1 , • • • , n, the degree 



12 


of fa is equal to k, then the roots of each fa are real, distinct, and lie in the interval 
(a, b). Further, if p is any polynomial of degree less than k , then <pk is orthogonal to 
p. The following theorem validates the method of Gaussian quadrature. 

Theorem 3.1 (Gauss-Jacobi) Let w be a positive weight function and {fa, fa, • • • , fa} 
a set of orthogonal polynomials with respect to the inner product (2.3) such that for 
each k = 0,1, •••,«, the degree of <f>k is equal to k. Let , < 2 , - • • ,t n be the zeros of 
<f> n , where a < t x < < 2 < ••• < t n < b. There exist positive constants Ai, A 2 , • • • , A n 
such that fj)p(t)w(t)dt = Ylk=i A kp(tk) whenever p G ^n-i- 


Proof: Let p belong to iP 2n _i, and define q G l*n-i by 

?(*) = ]£/>(** )4b(0 

Jt=i 

where ik{t) is the Lagrange polynomial (3.1). Since ik{t) satisfies 


£k(tj) = Skj = 


0 if k ^ j 

1 if k = j 


then q(tk) = p(h) for each k = 1,2, •••,«. Therefore p(t) — q(t) belongs to ? 2 n-i 
and has zeros at t\, < 2 , • • • , t n . Since these are precisely the zeros of the nth degree 
polynomial <f) n , we have 

p{t) - q(t) = <M f ) r „-i(0 

where r n _! G Since the degree of r n _ x is less than the degree of fa, then 4> n is 

orthogonal to r n _j, and therefore 


f*p{t)iv{t)dt 


/a [?(0 + 

f«q{t)w(t)dt 


/ 

Ja 


E pMhw 


Uc=l 


w(t)dt 




Define 


A. = 


(3.2) 
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Then 


/oP(*MO<ft = H *kp(tk)- ( 3 - 3 ) 

k=l 

To see that each is positive, note that (^(f)) 2 € r 2 n- 2 i and therefore 


J=1 


Also, (^e(<)) 2 has zeros at t\,t^ • • ■ f*+u • • • and (^jt(f*)) 2 = 1. So 

= EAitfkfc)) 2 

J=1 

= sl(lk(t))Mt)dt > o. 

Although (3.2) defines the weights in the Gauss formula (3.3), other methods exist 
for their determination. For references, see Gautschi’s survey [13]. 



CHAPTER IV 
STABILITY 


Let A be a nonsingular matrix in c nxn and 6 an element of C n . Then the equation 
Ax — b has a unique solution for x, where x belongs to C n . In general, the accuracy 
of the solution x is affected by the accuracy of A and b. The problem is considered 
to be stable , or well-conditioned if only small perturbations in the solution x result 
from small perturbations in A and b. 

In order to quantify the magnitude of a perturbation, it is convenient to define a 
vector norm || ||. If x is the solution of Ax = 6, and x + y is an approximate solution, 
a convenient measure of the magnitude of the error is the relative perturbation of x, 
defined by ||y||/||x||. 

Suppose the matrix A and the vector b are perturbed by F and k respectively. If 
x + y is the solution of the perturbed system, then 

(A + F)(x + y) = 6 + k. 

An upper bound for the relative error depends not only on the magnitude of the 
perturbation of A and 6, but also on the matrix A. The following theorem will be 
helpful in determining an upper bound. Details of the proof of this theorem and the 
following results are given in [31]. 

Theorem 4.1 If || || denotes any matrix norm for which ||/|| = 1, and z/||A/|| < 1, 
then (I + M)~ l exists and 

The following definition of the condition number of a matrix will be useful in 
expressing an upper bound for the relative error. 

Definition 4.1 Let A be a nonsingular matrix. The condition number of A with 
respect to || ||, denoted C(A), is defined by 

C(A) = ||A||||A- 1 || 

An upper bound for the relative error is given by the following theorem. 
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Theorem 4.2 Let A and F belong to C nXn , and let x,y,b, and K belong to C n . 
Assume that {A + F)(x + y) = b+ k. Assume also that ||A~ 1 ||||.F|| < 1 ll-^ll = 1* 
Choose a vector norm which is compatible with the matrix norm. (That is, ||Ar|| < 
||x|| for all x in C n and all A in C nXn .) Then the relative error satisfies 


C(A) 


1-(C(A)OT/|| J 4||) 


V 11611 


+ !!£!f\ 

\\A\\ 


As expected, the relative error in a solution x of Ax = b will be small if k and F 
are “small” relative to b and A. But the condition number of A also determines the 
upper bound. If C(A) is large, then it is possible that small perturbations in A and 
b could result in a rather large relative error in the solution x. Since 


C(A) = \\A\\\\A- l \\>\\A.A-'\\ = \\I\\ = l, 


then a well-conditioned system would have C(A) close to 1. 

If there is no perturbation of the matrix A , the above upper bound for the relative 
perturbation of x reduces to 

M/Ml < C(A) (Mj 

In case the only perturbation is in the matrix A , then 



x| 


< 



Let A = (flij) belong to c nxn and define the matrix norm j| ||i by 


Mi = max 1 < i < f ,£|a 0 -|. 

t=I 

Consider annxn complex matrix generated by the functions e Alt , e A2 ‘, • • • , e An *, where 
A 2 , • • • , A n are complex constants such that Re A x < Re A 2 < • • • < Re A n . For real 
numbers tj, ti, ■ • • ,t n satisfying 0 < t x < t 2 < • • • < t n < oo, define the matrix 


E n = 


/ e Al(l e A2<1 


o^n^2 


V e 


1 tn ^ n ... p^rx^n 
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It is shown in [ 32 ] that if E n is nonsingular, then the condition number of E n is greater 
than or equal to n. If Aj, • • • , A n are real and distinct, E n will be nonsingular for 
any choice of (^,<2, • • • ,<„) provided 0 < t x < <2 < • • • < t n < 00. Therefore, the 
existence of a choice for (ti, <2, • • • ,t n ) which produces a matrix E n with minimum 
condition number is assured. The paper also produces upper and lower bounds for 
C(E n ) for some special cases. 



CHAPTER V 
OBSERVABILITY 


Let A be a constant matrix in C nXn , c a constant vector in R n , and / an interval. 
Let x be a function from / to R". Consider the linear system 


x = Ax y = c T x x(t 0 ) = 


ai 

a 2 

\a n J 


to € I 


(5.1) 


The function y is called the output function. In most of what follows, it will be 
assumed that t 0 = 0, since an appropriate translation of the variable transforms a 
system into this form. It will also be assumed that / has the form [0, oo) or [0,6] for 
a real number 6 > 0. 

Let X\ , x 2 € R n be values for x(i 0 ) and the output functions yi,y 2 solutions of 
(5.1) corresponding to the initial values Xj and x 2 respectively. The system is said 
to be observable when y\ = y 2 if and only if x\ = x 2 . An observable system is said 
to be discretely observable with respect to points ,t 2 ,---,t m in the interval I and 
nonnegative integers $i, s 2 , • • • ,s m satisfying YlT=i s k = n if whenever y is a solution 
of (5.1) satisfying 


y( f i) = Vih) = •• ■ = y(t m ) = 0 and if m < n 
yVk) = y"(t k ) = - ■■ = yl’*-'\t k ) = 0 (5.2) 

for each k where s k > 1 


then y(t) = 0 on I. 

If the system (5.1) is observable, then the constant vector c has no zero compo- 
nents. The solution for x is given by 

x(t) = exp(At)x(0). 

The n x n matrix exp(At) has the form exp(Af) = ( q x: (t)), where each qij(t) is given 


17 
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by 

ffu(o = 51 Pii*(*)c Afct 

*=1 

where {A 1? A 2 , • • • , A,} is the set of distinct eigenvalues of A, and each pijk is a poly- 
nomial of degree less than n. The output function y is given by 

y(t) = c T x(t) = c r exp(/tt)x(0) = £ b ij * ?ii(0 

»\j=i 

where each 6^ is a constant. Regrouping the terms gives y(t) = r i{^) e ' K,t i where 
each r* is a polynomial of degree less than n. 

The eigenvalues of the matrix A determine the form of the solution y, and it will 
be assumed that A is in Jordan canonical form. This assumption leads to no loss of 
generality since every matrix is similar to a matrix J in Jordan canonical form, and 
the linearity of the dynamical system is preserved under a change of variables of the 
form Z = Px, where A — P~ 1 JP. 

Suppose a set {t;}” C I satisfies (5.2). Assume the eigenvalues of the matrix A 
are distinct and that 



In this case, the output function y is given by 

y(t) = 

i=l 

for constants &i, 6 2 , • • • , b n . Then y(t) = 0 if and only if 6, = 0 for each i = 1,2, • • • , n. 
This will hold if an only if the system 
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has only the zero solution for the vector (b\, b?, • • • yb n ) T . Therefore the system will 
be discretely observable with respect to fj, t 2 , • • * ,t n if and °nly if the matrix 



is nonsingular. 

Suppose the eigenvalues Aj, A 2 , • • • , A n of A are all real. It is shown in [28] that 
the determinant of E is positive provided \i < A 2 < • • • < A n and t\ < < 2 < • • • t n . 
So, in this case, the system would be observable for any choice of n distinct points. 

If some of the eigenvalues of A are not real, it is shown in [39] that the determinant 
of E is nonzero if <1 is chosen so that t\ ^ 2rmr/(\ k — A ; ) for all integers m and all 
k ^ j, where 1 < k,j < n, and for t k = kti, where i = 2, ■ • ■ ,n. The thesis also 
provides some examples of discretely observable systems in case A belongs to C 2x2 or 
C 3 * 3 . 

The question of discrete observability is related to an interpolation problem defined 
as follows. 


Definition 5.1 Let V be an n- dimensional linear space. Let L\, X 2 , • • • » L n be given 
linear functionals defined on V, and let W\,W 2 , • • • ,w n be given constants. The set of 
ordered pairs {(Lk,Wk))l = idefines an interpolation problem. If x € V. and Lk{x) = 
Wk for each k = 1,2, • • • , n, then x is said to be a solution of the interpolation problem. 

Let V = {y | y is an output function of the system (5.1)}. Then V is of dimension 
n if and only if the system is observable. For a proof of this and the following theorem, 
see [39]. 


Theorem 5.1 Let {tj , t 2 , • • • , t n } be a set of discrete points in I, and define lin- 
ear functionals Li, L 2 , - ■ ■ , L n by L k (y) = y(t k )- Then the interpolation problem 


{(Lit, 0)}£ =1 has a unique solution if and only if the system (5.1) is discretely ob- 


servable with respect to 


Consider a system with a nilpotent matrix >1, where A n = 0. In this case, the 
output function y can be shown to be a polynomial of degree less than or equal to 
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7i — l. The interpolation problem has a unique solution for any choice of points 
ti,t 2 ,- (See [8].) Therefore, the system is discretely observable with respect to 

these points. 

Another condition which is equivalent to discrete observability is given by the 
following theorem in [8]. 

Theorem 5.2 Let V be an n-dimensional linear space and V" its dual space. Let 
L u L 2, — ,L» be elements ofV m . The interpolation problem defined by {(Lk. ^k)}/t=i 
possesses a solution for arbitrary values u>i, w 2 , • • • , w n if and only if the function- 
als L\, Li, • • • , L n are linearly independent in V*. The solution to the interpolation 
problem is unique. 

Once it has been established that a system is discretely observable with respect 
to a set of points ti,t 2 ^ • • ,t n , determination of the output function y can be made 
in a number of ways depending upon the characteristics of the matrix A. The choice 
of a method is influenced by the size of A and by its eigenvalues. For example, if the 
matrix A belongs to IR 2 * 2 and the eigenvalues \ x and A 2 are real and distinct, then y 
is given by 

y(t) = b 1 e x ' t + b 2 e x ' t . 

Given any two nonnegative values t x and t 2 , with y{t\) = u>i and y{t 2 ) = w 2 , the 
constants bi and satisfy 



In this case, the inverse of the matrix 

/ e *,t. e V, \ 

^ e Xih e hh J 

can be easily computed and the coefficients bi and 62 determined with accuracy. But 
if the matrix E is large, computing the inverse is not practical. Also, a s discussed in 
Chapter IV, it might be poorly conditioned. In that case the accuracy of the solution 
might be poor regardless of the method used to solve the above equation. 

It is shown in [8] that if W = C[a,b] and t x ,t 2 ,---,t n are distinct points in [a, b], 
then the linear functionals L\, L 2 , • ■ ■ , L n defined by Lk(f) = f(tk) a re independent 
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in C[a,b]. It is also shown that if X is an n -dimensional linear space, then the dual 
space X’ is also n-dimensional. These two theorems imply the following. 

Theorem 5.3 Let V = {y | y is an output function of the system (5.1)}, and let 
• • ,t n be discrete points in [a, 6]. Let L be any linear functional in V". Define 
Li,L 2 ,---,L n in V ‘ by L k (y) = y(f*). Then there exist constants "A such 

that 


L{y) = biLi(y) + b 2 L 2 (y) + h b n L n (y) 

= biy(ti) + + * • • + b n y(t n ) 

The above theorem suggests the applicability of quadrature methods when the linear 
functional L is defined by 

L{y) = S k .y(t)dt. 


This will be discussed further in Chapter VII. 



CHAPTER VI 
TCHEBYCHEFF SYSTEMS 


In the method of Gaussian quadrature, orthogonal polynomials are used to deter- 
mine nodes, • • * , t n , and weights, A 1? A 2 , • • • , A n , such that 

= El UhpihMh) 

for all polynomials p of degree less than or equal to 2n — 1. Theoretically, nodes and 
weights exist which make 

l b J(t)w(t)it = ELi hf(t k )w(h) 

for other collections of functions provided they satisfy certain conditions. In this 
chapter an introduction to TchebychefF systems will be presented, and it will be 
shown that quadrature formulas are possible for functions which form such a system. 


Definition 6.1 Let u 0 ,«i, • • • ,u n denote continuous real-valued functions defined on 
a closed finite interval [a, b]. The collection of these functions will be called a Tcheby- 
chefF system over [a, 6], abbreviated T-system, provided the determinants 


( 0, 1, •••, n 

y ^Ot ll: i In 


U 0 (*o) «o(*l) U 0 (<n) 

Ul(fo) Ui(fi) U 1 (In) 

Un(to) «n(M Un(<n) 


( 6 . 1 ) 


are strictly positive whenever a < t 0 < t\ < • • • < t n < b. The collection of functions 
u 0 , u h • ’ * > u n will be referred to as a complete TchebychefF system, abbreviated CT- 
system, if {u 0 , Uj, • • • , u r } is a T-system for each r = 0, 1, • • • , n. 


An example of a CT-system is the collection {tx,}[} of functions defined by u 0 (£) = 
1, and u,(t) = V for i a positive integer. For any choice t 0 , t x , • • • ,t n , where t 0 < ti < 
• • • < t n , the determinant 

I 1 ••• 1 

foi ^1 ' In 

4 n a n a n 

Cq Cl ' - - l n 
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is the Vandermonde determinant and has the value 

n (t,-u)>o. 

Q<i<j<n 

The following theorem guarantees that the set {exp(a,f)}£ is a T-system on any 
closed interval if each a, is real and a 0 < Qi < • • • < a n . 

Theorem 6.1 Let {x,}£ and {y,}£ be sets of real numbers where x 0 < Xi < • • ■ < x n 
and yo < yi < - • • < y n - Let 

exp (x 0 y 0 ) exp(x 0 yi) ••• exp(x 0 y n ) 
exp(xiyo) exp(xxyi) ••• exp(xiy n ) 

exp(x n y 0 ) exp(x n yj) ••• exp(x n y n ) 

The determinant of E n is positive. 

Proof: Let u n be any function of the form 

Un{y) = Yl aieXy ( 6 - 3 ) 

.=0 

where x, 6 E and a, € E for each i = 0, 1, • • • , n, x 0 < Xj < • • • < x n , and <*, 2 > 
0. 

It will be shown by induction that u n (y) has at most n distinct real zeros. First, 
let n — 0. For any real numbers a 0 ^ 0 and xo, the function uo defined by u 0 = aoe XQV 
has no zero. Suppose any function u k of the form (6.3) has at most k distinct real 
zeros. Let x 0 , Xi, * • • , xt +1 and Oq, aj, • • • , a^+i be real numbers with x k +\ > x k and 
a* + i ^ 0. Define 

«*«(») = £ 

k = 0 

Suppose has k + 2 zeros. Then Uk+\{y) exp(— Xk+iy) has k + 2 zeros, and by 
Rolle’s theorem, -j- [uk+i{y) exp(-ik+iy)] must have k + 1 zeros. But 

d_ 
dy 


k 

ak + i + ^a, exp[(x t - xjt +1 )y] 

t=0 

X>(x,- ) ®xp[(xj Xjt +1 )y], 

i=0 



Uk+i{y)exp(-x k+l y) = — £ a, exp[(x, - x k+1 )y] 

y t=o 
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which has at most k distinct real zeros by assumption. 

Next, induction can be used to show that det(£’ n ) / 0. Given real numbers x 0 and 
y 0 , define Eq = (exp(xoyo))- Then det(£ , 0 ) > 0. Suppose det(£*_i) ^ 0 for all sets 
of real numbers {s t }£ _1 and {t,}£ _1 , where s 0 < «i < • • • < $fc-i and to < *i < * * * < 
tk- 1 - Given sets {xJq and {y,}£ satisfying x 0 < xj < • • • < x& and yo < yi < ‘ < yk, 

suppose det(-Ejt) = 0. Define u* by 


/ gXOVO 


ttfc(y) = det 


e *iyo 


e x°yi 

e riV, 


e xoyk-i e *oy \ 
e xiVk-i e T,y 


y e x kVO e XkVl . . . e XkVk-l e XkV J 


Then Uk has k + 1 distinct real zeros yo, yi, ■ • ■ , yjt- Expanding by the last column 
gives itfc(y) = £*= o a , eX ' y - By assumption, each a, ^ 0, so Uk can have at most k 
distinct real zeros. Therefore, det(£’^) cannot be zero. 

Now that we have det(£ n ) ^ 0, induction can be used to show that it must be 
positive. First, Eq > 0 for all real numbers xo and yo- Next, suppose det(£' l t_i) > 0 
for all matrices of order k of the form (6.2) where x 0 < X! < • • • < x*-i and y 0 < 
yi < • • • < yk- 1 - Let Xk be a real number and Xk > Xk-\ • For any real number y, the 
determinant 


det(£ fc (y)) = 


exp(x 0 y 0 ) exp(x 0 yi) 
ex P(xiy 0 ) exp(x 1 y 1 ) 


exp(x 0 yjt-i) exp(x 0 y) 
exp(xiy t _ 1 ) exp(x!y) 


exp(x fc y 0 ) exp(x fc yi) exp (x k yk-i) exp {x k y) 


can be evaluated by expanding by minors of the last column to get 

Jt 

det(£ fc (y)) = ^<z,exp(x,y), 

i=0 

where the coefficient a* is the determinant of a matrix of order k of the form (6.2) 
and therefore positive by assumption. For all y > y fc _ x , det(£ fc (y)) ^ 0. So, on the 
interval (yfc_i,oo), either det(£’ t (y)) > 0 or det(£ t (y)) < 0. Suppose det(£’ i (y)) < 0 
on (y k - 1 , oo). Then 

k - i 

exp(-x fc y)det(£jk(y)) = a k + £ a, exp[(x, - x^y] < 0 for all y > y k -\. 

i=0 
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But this is impossible since 

lim (a k + V a, exp[(x, - x*)y] ) = a k > 0 

tf -°° V ) 

So det(£fc(y)) > 0 for all y > y k -i. D 

Other examples of T-systems can be constructed by noting that if {u;}J is a 
T-system on [a, 6], the following hold. 

1. If the function r is continuous and positive on [a, 6], then {ruj}o is a T-system 
on [a, b\. 

2. If the function r is continuous and increasing on [c, d] and the range of r is [a, 6], 
then {«i o r}J} is a T-system on [c,d]. 

Since {exp“‘ f }2 is a T-system for q 0 < aj < • • • < a n and In* is continuous and 
increasing, (2) implies that {f OI, }o is a T-system on any closed subinterval of (0,oo). 
The next definition generalizes the concept of a polynomial. 

Definition 6.2 Let {u,}£ be a set of real-valued functions defined on the interval 
[a, 6]. A function of the form u = Y^?=o a i u ii where each a, is real, is called a u- 
polynomial. A u-polynomial is said to be nontrivial ifYl?= o a ' 2 

Example. A Dirichlet polynomial £”=o a.e 0 '* is a u-polynomial in the T-system 

If { u »}o a T-system, the functions u 0 ,ui, ■ • ■ ,u n are linearly independent, and 
therefore any u-polynomial, u = ^ =0 aiU t , is uniquely determined by the coefficient 
vector (ao,ai,-",a n )' Furthermore, a u-polynomial is determined by specifying its 
values at n-f 1 distinct points. In particular, any nontrivial u-polynomial has at most 
n distinct zeros. 

Definition 6.3 The number of distinct zeros on an interval [a, 6] of a continuous 
function f is denoted by Z(f). 

Example. Let Uo(t) = 1, and Ui(t) = V for i = 1, •••,«. As already mentioned, 
{u,}2 is a T-system on an interval [a, 6] and there exist real orthogonal polynomials 
<£ 0 , 4>\i • 4>n such that for i = 0, 1, • • • , n, the degree of <£, is equal to i. For each i, 

<fri has exactly i real distinct zeros, all in the interval (a,b). So Z(4 >, ) = i. 
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We know that if {uj}£ is a T-system, then Z(u) < n for any u-polynomial. The 
following theorem gives a sufficient condition for a set of functions to be a T-system, 
which depends on knowledge of the zeros of all u-polynomials. 

Theorem 6.2 //{«,}£ is a T-system, then Z(u) < n for every nontrivial u-polynomial 
u. Conversely, if a system {u,}g of continuous functions on [a, b] satisfies Z(u) < n 
for every nontrivial u-polynomial u, then {ui}[} is a T-system except possibly for the 
sign of one of the functions. 

Proof: Let {u;}J be a system of continuous functions on [a, 6] and Z(u) < n for 
any nontrivial u-polynomial u. Define the function P, where 

P : [a, b] n+1 C ! n+1 -» R and 

Uo(to) Uo(tl) ■■■ «0 (tn) 

Ul(to) Ui(ti) ■■■ Ui(t n ) 

P(t Ot ti, • • • , t n ) — 

Un(to) U n (*i) ••• U n (t n ) 

Then P is continuous on the region [a, fc] n+1 . If there exists a vector ( t 0 , <i, • • • , t n ) T 6 
[a, 6] n+1 such that P(t 0 , ft, • • • , t n ) = 0, then there would be a nonzero solution of 

t*o(*o) M*i) «o(<n) \ f 

Ui(f 0 ) Ui(ti) ••• U\ (t„ ) Ci 

Un(*o) «n(*l) ••• W n (<n) / 

For this nonzero solution, define 

u(t) = CoU 0 (t) + CiU^t) + • • • + c n u n (t). 

Then u is a u-polynomial with n + 1 zeros, t 0 , t\, • • • , t n . Since Z(u) < n for every non- 
trivial u-polynomial, P(to,t\, • • • ,< n ) must be nonzero for every choice (to,ti, ■ • • ,t n ) T € 
[a, 6] n+1 . By the intermediate value theorem, P{to,t\, •••,*„) must have a fixed sign 
in [a, 6] n+1 . □ 

Corollary 6.1 If {u ,}q is a CT-system, then for each k = 0, l.---,n, Z(u) < k , 
whenever u = I^=o a « u '> eac h a j is real, and £f_ 0 a, 2 > 0- 
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An important property of a polynomial is the multiplicity of each of its zeros. 
If the multiplicity of a real zero is even, then the polynomial does not change signs 
at that point. The following definition categorizes the zeros of continuous functions 
according to whether or not the function changes signs at that zero. 

Definition 6.4 For any continuous function f on [a, 6], an isolated zero to € ( a i^) 
of f is called a nonnodal zero provided f does not change sign at to. All other zeros 
including zeros at the end points a and b are called nodal zeros. 

Definition 6.5 The number of zeros on [a, 6] of a continuous function f , where nodal 
zeros are counted once and nonnodal zeros twice, is denoted by Z(f). 

Theorem 6.3 //{u,}2 is a T-system, then Z(u) < n for every nontrivial u-polynomial 
u. Conversely, if {u,}£ ,s a system of continuous functions on [a, 6], and Z(u) < n 
for every nontrivial u-polynomial u, then {u,}q 15 a T-system except possibly for the 
sign of one of the functions. 

Proof: The second part of the theorem follows from Theorem ( 6 . 2 ) since Z(u) < 
Z(u). To prove the first part, assume Z(u) > n + 1 for some nontrivial u-polynomial 
u. If u has no nonnodal zero, then Z(u ) = Z(u), and Theorem ( 6 . 2 ) would be 
contradicted. 

Assume u has at least one nonnodal zero. Let t lx t 2 , • ■ ■ , tk be distinct real zeros 
of u with *i < *2 < For each nonnodal zero t t , choose e, > 0 so that if 

i ^ Jfc, then U < U + e, < t i+ 1 , and if i = k, then t, < t, + e, < b. If t : is the 
first nonnodal zero (i.e., t 3 < t x for all nonnodal zeros i,), then choose e } > 0 so 
that if j ^ 1, then t : - 1 < t : — e r and if j = 1, then a < t 3 — e r Define the set 
5 = {s|s is a zero of u, or s = t{ ± e; for a nonnodal zero t, of u}. This set is finite 
and contains at least n + 2 elements. Label the elements of S = {so> 5 ii • • ‘ so 
that s, < Sk whenever i < k. Consider the points s 0 , Sj, • • ■ ,s n+ i. Then u(s,) > 0 for 
i odd and u(s^) < 0 for i even, or vice-versa. Also, 

u(s 0 ) u(si) ••• u(s n+ i) 

Uo(-So) Uo(Sl) ••• U 0 (s n+1 ) _ 0 

^n(-So) ^n(^l) ^n(-Sn+l) 
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because the first row is a linear combination of the following rows. If the above 
determinant is expanded by the first row, we have 

n+1 

a,u(sj) = 0. 

1=0 

Since {u,}J is a T-system, each minor is positive, and therefore the a, alternate in 
sign, as do the tt(s,). Either a,u(s,) > 0 for all i or a,u(s,) < 0 for all z, so aju(s,) 
must be zero for all i. Since each a, ^ 0, Si must be a zero of u for i = 0, 1 , • • • , n + 1 . 
But by theorem (6.2), Z(u ) < n, so the assumption that Z(u ) > n + 1 is false. □ 
In the proofs of some of the following theorems, the existence of a nonnegative 
u-polynomial which vanishes only at prescribed points will be assumed. The following 
theorem due to Krein guarantees the existence of such a function. Before stating the 
theorem, the following definition of a weight will be needed. 


Definition 6.6 Let T = {£j, •••,£*} be an increasing set of distinct points in (a, 6]. 
To each U G T a weight <*>(£,•) is defined by 


u;(t,) = 



ti e (a, b ) 

ti = a or b 


Theorem 6.4 (Krein) If {u,}£ is a T-system on [a, 6] and < n , then 

there exists a nontrivial, nonnegative u-polynomial u vanishing precisely at the points 
of T = {<i, • • * , tfc} . The only exception is that if n is even, and exactly one of the 
end points a or b is in T then u(t) may vanish at the other end point as well. 


Proof: First let n = 2m + 1 for m a positive integer. Suppose a < t\ < <2 < * • • < 
t k < b. Select points tfjf, - • - ,t m . k ' so that t k < tf < tf < • • • < t m - k < b. Let 
the set {sj}i m+1 consist of the points 

a i ^i> ^1 + £■) ^2 1 •••> t k , t k + e, t\ , tf + e, t m _ k ', t m - k + t (6.4) 

where t > 0 is chosen so that the above sequence is increasing and contained in [a, 6]. 
Define the polynomial 

, . / 0, 1, • • • , 2m, 2m + 1 \ 

ufit) = u I 

y S], S 2 , ) , S2m+l? t J 
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(6.5) 

Wo(-Sl) ••• «o( 5 2 m+l) “o(0 

^2m+l(^l) W2m+l( , S2m+l) ^2m + l(0 

Since {«,}£ is a T-system, u t (t) vanishes precisely on the set {-Si} 2m+1 , and each zero 
is nodal. In particular, 

M0>0 if s 2 .-i < t < s 2i (i = 1,2, — ,m) 
or s 2m +i <t <b. 

Expanding (6.5) by the last column gives 

u <(0 = ] C a *( e ) u «(*) 

i=0 

where K(e) — Hr = o[ a »( e )] 2 is positive. Define the function 

»(m) = 

v A (e ) ,=0 

= Z] where 6, -(f) = 

Then 5ir=o[M € )] 2 = 1 for each positive e suitably close to zero. 

Select a sequence {e^} of small positive numbers so that — + 0. Since |6,(ej)| < 1 

for i = 0, 1, • * *n and j a positive integer, then there exists a subsequence {e Jk } such 
that {6,(e Jk )} converges for each i = 0, 1, Let a 0 ,ai,---,a n be limits of the 

subsequences, where 

{ ( e ik ) } ~ 1 " a i i 

and define 

«(0 = X]«i«i(0- 
1=0 

Then £" =0 a, 2 = and fi vanishes at the points a,t u t 2 ,- ■ ■ y t k ,t x ',t 2 \ • • • ,<«-*'• All 
points except a are nonnodal zeros, so Z{u) > 2m + 1. By theorem (6.3), Z(u) < 
2m + 1, so these are all the zeros of u. 
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Next, construct in a similar manner a nonnegative u-polynomial u* that vanishes 
precisely at points t x , t 2 , •••, t k , ^ C-fc’ ^ where the points in {t"}™ 
are chosen so that {f'}™ - * D = 0- Then the u-polynomial u(£) = u(<) + tt “(0 

is nonnegative and vanishes precisely at points £ 1 , • • • , t*. 

In case {<!,••*,<*} is such that either ti = a or £* = but not both, then 
the desired u-polynomial can be constructed in a manner similar to the above. If 
t x = a and t k = b, select points t\, t' 2 , • • • , t' m _ k+l satisfying t k - 1 < £'• < t k for 
j = — Jb + 1. As before, choose e > 0 small enough that the sequence 

{si(c)}i m+1 , consisting of points 

a, + e, t 2 , t 2 + i + c, t\, t\ + e, "b e, 6, 

is increasing. Define u e (£) as before in (6.5). Letting e — » 0 as above, a nonnegative 
polynomial u is determined which vanishes at the points 1 1 , • • • , fit-i , t\ » • ■ • , £ m-fc+i » £*• 
Since ^ = a and t k = 6, Z(u) = 2m. Since, Z(u) < 2m + 1 by Theorem 6.3, u cannot 
vanish elsewhere. 

Next construct as before a nonnegative u-polynomial u‘ which vanishes precisely 
at the points ti,- • •,<fc_i,t / 1 / ,- •• where {£'-}™“* +1 fl {t"}?~ k+l =0- Then 

u(f) = u(<) + u“(£) is a nonnegative u-polynomial which vanishes precisely at the 
points fi, • * • ,t k - 

In case n is even, an argument similar to the above produces a nonnegative u- 
polynomial u vanishing at precisely the points T = {£j, • • • , £*} unless exactly one of 
the end points is in T. In that case the proof leaves open the possibility that u might 
also vanish at the other end point as well. D 

Theorem 6.5 Let {u,}£ be a T-system. Let T = {ti,t 2 , ■ ■ ■ ,t k ) be a set of points 
in [a, 6], and let u : T — ► {1,2} be such that u>(f,) = 1 if t, = a or t x = b. Suppose 
£}*_i <*>(£;) < n. Then there exists a u-polynomial u such that u(t) 0 fort € ( a,b) — T 
and such that £, is a nodal zero if u/(£,) = 1 and a nonnodal zero if jj(t t ) = 2. If n is 
odd, the polynomial vanishes precisely on the set T . 

In attempting to extend the method of Gaussian quadrature to functions other 
than polynomials, it will be helpful to define the moment space of a T-system. Its 
characterization depends upon a theorem of Caratheodory, the proof of which depends 
on the following theorem. (See [40].) 
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Theorem 6.6 Let A C R n . The convex hull of A is the set of all finite convex 
combinations of elements of A. (A convex combination of points Ui,« 2 , • • • in A 
is a linear combination of the form £?=i tiUi, where U > 0 for all i = 1,2, • • • ,p, and 

ELi = 

Proof: Let B be the set of all convex combinations of elements of A. Denote the 
convex hull of A by C{A). Since C(A ) is a convex set containing A, it contains all 
finite convex combinations of elements of A. So B C C(A). 

Suppose x and y are elements of B. Then x and y have representations of the 
form 

n m 

x = and y — 51 

i=l i=l 

where A, > 0 for all i = 1, •••,«, pi > 0 for all i = l,---,m, E"=i = an< ^ 
ET=i t*i = 1- Let t € [0, 1], Then (1 - t) € [0, 1] and 

n m 

tx + (1 - t)y = + ^((1 - t)p t ]y, 

;=i «=i 

is a convex combination of elements of A. So B is a convex set, and B contains A. 
This gives C{A) C B and therefore C(A) = B. E 

The proof of Caratheodory’s theorem is given in [40], and it refers to the following 
theorem which is easily verified and therefore stated without proof. 

Theorem 6.7 Let Xj, • • • ,x p be points in V , where V is a linear space over R. The 
following statements are equivalent: 

1. For any j , the vectors Xj — Xj, i ^ j , are linearly independent. 

2. If ai, - ,a p are real numbers such that 

p p 

T, QjX, = 0 and y a, = 0 

i=i t=i 

then Qj = «2 = ■ • • = = 0. 

Theorem 6.8 (Caratheodory) Every point belonging to the convex hull of a given 
set A in R n can be represented as a convex combination involving at most n + 1 points 
of A. 
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Proof: Let x belong to the convex hull of A. Then by Theorem 6.6, there exist 
x l5 * • • i x p € A and A 1? • • • , X p € R such that A, > 0 for i = 1, • • • ,p, A, = 1, and 

p 

X = Y^ A ,x, . 

i=i 

Suppose there exist p\, • • • ,p p € R, not all zero, such that 

p p 

/*.*. = 0 and ]T fii = 0. 

i=l i=l 

Then for every p € IR, 

x = Y1 ^ x * - pYj a*«*» = ( 6 - 6 ) 

t=i «=i i=i 

Define the set 

5 = {cr 6 1 | a pi < X i for 1 < i < p}. 

Then S is a closed interval of the form [a,/?], as the following argument shows. For 
each i such that pi 0, o must be in the interval [A,//ii,oo) if pi < 0 or (— 00 , A « / /<«] 
if pi > 0. Denote the appropriate interval by f. Then a belongs to the intersection 
of all the intervals /, which correspond to /i, ^ 0. Since the p x are not all zero and 
£f_i Pi = 0, there exist integers m and n in {1, • • • ,p} such that p m < 0 and p n > 0. 
So the intersection is a closed bounded interval [a, /?], where a = X 3 f p : and j3 = \k/ Pk 
for some j and Ar in { 1 , • • • , p} . 

Substituting a for p in the above representation (6.6) for x gives 

p 

x = ^(A, - ap,)x„ 

1=1 

where (A, — ap t ) > 0 for i = and £f =1 (A; — otpi) = 1. Since a = X 3 fp } , 

then Xj — apj = 0. Thus x is a convex combination of p — 1 points of A. Rename 
these p — 1 points x l5 • • • , x p _i. If there exist p,, • • • ,p v ~\ € R, not all zero, such that 
piX{ = 0 and Pi = 0> then the above process can be repeated to produce 

a representation of x as a convex combination of p — 2 points of A. Continuing this 
will eventually produce a representation of x as a convex combination of q points 
of A , where q < n + l. To see this, suppose there exists a representation of x 
as a convex combination of q points Xi,***,x 9 for q > n + 1. Suppose also that 
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whenever pi, • • • , p q are real numbers such that £Li AW = ® anc * £?=i P* — ^" en 
p x = p 2 = ••• = p q = 0. Then for any j, the set of q - 1 vectors x, - x jy for 
i ^ ji is linearly independent. But q — 1 > n, contradicting the fact that any linearly 
independent set in lR n contains at most n vectors. □ 

Before defining the moment space, it will be useful to state the following definition 
and theorems due to E. Helly. Proofs of the theorems are given in [24]. 

Definition 6.7 A set 7 of functions is said to be uniformly of bounded variation on 
[a, 6] if there exists a constant M such that V(f) = /* |d/| < M for all f € 7 ■ 

Theorem 6.9 (Helly’s Selection Principle) Let {0m,n} be a double sequence of 
real numbers which is bounded by A; i.e., 


|0 m ,n| < A for all m, n. 

Then there exists a subsequence {n^} and a sequence {0 m } of real numbers such that 
for every positive integer m, 

lim @m,nk — 

fc— * oo 

Theorem 6.10 (Helly) If {er n } is a sequence of functions, uniformly of bounded 
variation on [a, 6] such that a n (a) is bounded for all n, then there exists a subsequence 
{<r nk } and a function a of bounded variation such that lim^—oo <7 n *( x ) =: a i x ) f or a ^ 
x in [a, i] . 


Definition 6.8 Let {u,}q be a T-system on the interval [a, 6]. The moment space 
M n +i with respect to is defined to be the set 


M 


n+l 


{c= (CD, c,, S £” +1 I c, = fu,(t) da[t),i = 0,1, 

J a 

and o is a nondecreasing function of bounded variation 
which is right continuous on (a, 6)}. 


• ,n, 


Theorem 6.11 The moment space M. n +\ is a closed convex cone. 


Proof: If A is a positive real number and c is an element of A4 n+1 , then Ac also 
belongs to M n+ j. So M n +i is a cone. Also, for any c t and c 2 in M n+ i, tc x + (1 — t)c 2 
belongs to M n +i for all t satisfyng 0 < t < 1. So M n + i is convex. 
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To show that _A4 n +i is closed, let u(t) = 5Z” = o a » u »(0 denote a strictly posi- 
tive u-polynomial. The existence of such a polynomial can be demonstrated us- 
ing Theorem 6.4. (To construct one, find two nonnegative u-polynomials that have 
no common zeros. Then their sum is a strictly positive u-polynomial.) Let c = 
(co, •••,c n ) £ E n+l and {c^} a sequence in Af n +i with lim r _oo c (r ^ = c. For each r, 
c (r) = (£\c{ r \ • , 4 r) ), where 

c, (r) = / Ui(t)da r (t), 

Ja 

and a T is a nondecreasing function of bounded variation which is right continuous on 
(a, 6). For convenience, a T can be chosen so that <r r (a) = 0 since / a 6 / dcr = f a f d(a + 
K ) for every function a of bounded variation and every constant K. Since {c* r *} is 
convergent, the sequence {H” = c> a J c !^} i s bounded. So for some constant M , 

M > y, o,c- r) = f u(t)da T {t) > ( min u(r)) f*da r (t), 

“ Q Ja \a<r<b / 

where min a < r <(, u(r) > 0. Since a T is nondecreasing, the variation of <r r on [a, 6] is 
f*d(T r (t). Therefore, the sequence {cr T } is uniformly of bounded variation on (a, 6]. 
By Helly’s theorem, there exists a subsequence {cr rjlc } and a function a of bounded 

variation such that lim* oo cr rfc (f) = cr(t) for every t in [a, b]. Then for each i = 

0, 1, • ■ • ,n, the continuity of u t on [a, 6] implies 

c, = lim f*Ui{t)da rk (t) = f^Ui{t)da(t). 

k — * OC 

For a proof of this equality, see [24] or [38]. Since each cr rk is nondecreasing, and 
a Tk — ► <t, then a is also nondecreasing. Since a is of bounded variation on [a, 6], it has 
at most a countable number of discontinuities, lim 4 _ x - <r(t) exists for each x € (a, 6], 
and lim f _ x + a(t) exists for each x € [a, b). (For verification of this, see [38].) Define 
the function cr 0 by cr 0 (a) = <j(a), cr 0 (b ) = er(6), and &o{x) = lim ( _ x 4 cr(t) for all t in 
(a, 6). Then a = cr 0 almost everywhere in [a, 6], and a Q is right continuous on (a, b) 
and nondecreasing on [a, 6]. So cr 0 belongs to M n + i, and 

Ci = fa'll, dcr = faii,dcr 0 

for each i = 0, 1, • • • , n. Therefore c £ M . n+J . O 

Another characterization of M n + i results from the following definition. 
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Definition 6.9 Let {«,}” be a T -system on [a, b] and C n+l the curve in £ n+1 defined 
by 

C n +i = { 7(0 = {uo{t),ui(t),- • • ,u n (t)) | a < t< 6}. 

Let C(C n + 1) denote the smallest convex cone containing C n + i- 


Theorem 6.12 C(C n + 1) is closed and every 7 = ( 70 ,- •* > 7 n) € C(C n+ 1) can be rep- 
resented in the form 


n +2 

7 » = ^ ), I = 0, 


( 6 . 7 ) 


i - 1 


where for each j — 1 , • • • , n + 2, A ; > 0, and a < t : < b. 


Proof: Clearly, all vectors of the form ( 6 . 7 ) belong to C(C„+i). Since C(C n+ i) 
is convex, it must contain the convex hull of C n +i . In fact, C(C n+1 ) — I k — 
0 and fi belongs to the convex hull of C„+i}. Let 7 G C(C n + 1). Then 7 = W for 
some it > 0 and ft in the convex hull of C n+1 . By the theorem of Caratheodory, ft can 
be represented in the form fix — Q j u i{lj)i where Qj > 0 and a < t } < b. Then 7 
has the representation 7, = T.]=i ka : Ui(t : ), where ka j > 0 . 

To show that C(C„+i) is closed, suppose lim r _oo7 (r) = 7 * where 7 (r) € C(C n +i) 
for every r. If 7^ = (7^, * • • , 7i r ') anc ^ 7 = (7o, • • * ■> 7n)> then for each i = 0 , 1 , • • • ,n, 

n+2 

7, <r) = J2 A l r,u >( < j r) ) and r 1 “2,T! r) = 7n 

J=1 

where for every r, A^ ^ ^ 0 and a < t^j ^ < b. As seen before, there exists a strictly 
positive u-polynomial u(t) = £”_ 0 a i u i{t)- Since {7^} is convergent, there exists A/, 
such that for every positive integer r, 

M > X>, 7,' r| 

1=0 

n /n+2 \ 

= E-dL^r) 

«=o \j=i / 

n+2 / n \ 

= E»rz«(i!") 

J- 1 \ »=0 / 

71+2 / \ 

- g^ r) C<^ u(r) )- 
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So {A^} is uniformly bounded by M . Since each u, is continuous on [a, 6], {|u,-(fj ^)|} 
is uniformly bounded. 

For each positive integer r, consider the finite sequence 

Rn.r} = 

«o(4 r) )> * • * . «o(*£+a)» 
u l( <( i r) ) 5 Wx(4 r) ), * • ’ , «l(4+2)’ 

U n (^i r) ), W n (4 r) ), • • • ,«n(4+2)}- 

Then {|^ m ,r|} is uniformly bounded, and Helly’s selection principle guarantees the 
existence of a subsequence {r^} and sequences { Ay } and {u tJ } such that 

lim Aj r ** = A j j = l,*--,n+2 and 

k-+oc J 

lim Ui{t ( - k) ) = Uij i — 0, • • • ,n and j = 1, • • • , n + 2. 

Since Aj r *^ > 0 for all j and r*,, then A j > 0. Since each Ui is continuous on a closed 
interval, the range is closed, and therefore u t] is in the range of u,. So for each i and 
j , there exists at least one t’ £ [a, 6] such that 

lim „,((''*>) = u,(f). 

fc— OO J J 

This gives 7 = lim r _oo 7^ = (70, • • • , 7„) where for each i, 

n+2 

li = Jim 7, (r * ) = 

Ac— 00 * — : J 

So 7 £ C(C n +i), and C(C n+ 1) is closed. □ 

In the proofs of some of the following theorems, some properties of hyperplanes 
will be useful. For convenience, the related definitions and theorems will be stated 
here. Proofs can be found in [40]. 

Definition 6.10 A hyperplane H in E n is determined by a nonzero linear functional 
f:E n ^R, where f(x 1, • • • ,x n ) = £" =1 a,x; for real constants a 1( • • • ,a n , and a real 
constant d. H is defined by 

H = {veE"\Hv) = d}=f-'(d). 
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Let A,BC E n . Then H is said to separate A and B if either f(A) < d and f(B) > d, 
or f(A) > d and f{B) < d. H is said to separate A and B strictly if either f{A)<d 
and f(B) > d or f(A) > d and f(B) < d. 

Definition 6.11 Let V C E n and c an element of the boundary ofV.A hyperplane 
H = f~ l {d) in E n is said to be a supporting hyperplane for V atcifc^H and either 
f(V) > d or f(V) < d. A supporting hyperplane H for V is said to be non-trivial if 
V is not contained in H . 

Theorem 6.13 Let V be a convex subset of E n and c an element of the boundary of 
V. Then there exists a non-trivial supporting hyperplane for V at c. 

Theorem 6.14 Let V C E n be closed, convex, and non-empty. Let c € E n and 
c<£V. Then there exists a hyperplane in E n separating V and c strictly. 

With these theorems, it can now be shown that Af„+i and C(C n + i) are identical. 

Theorem 6.15 A4 n +i = C{C n + 1 ) = the convex conical hull of C n+ 1 - 

Proof: If 7 € C(C n + 1), then 7 = (70, • • • , 7 n) can be represented in the form (6.7). 
So for a given integer i, 7, is given by 

n +2 
7 , = Y , 

j=l 

where t x < t 2 < • • • < t n+2 , a < t s < b, and > 0. There exists a nondecreasing 
step function a which is right continuous on (a, 6) and has its only points of increase 
at the points t 3 and for which 

n+2 

Y AjUi( tj) = f a b u,(t)d(r(t). 
j = 1 

So 7 e Mn+ 1- 

Next, suppose c° = (cj 0 , • • • , c n °) € M n +\ but c° £ C(C n+ i). Since C(C n +\) is a 
closed convex cone, Theorem 6.14 guarantees the existence of a hyperplane strictly 
separating c° from C(C n + 1)- So there exist real constants a 0 , ■ • ■ ,a n , not all zero, and 
a real constant d such that 

Y a,c > > + d < 0, 

t=0 
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and for every 7 = (70, * * * > In) € C(C n+ 1), 

a, 7, + d > 0. 

1=0 

For each nonnegative A, (Au 0 (f), • • • , A u n (t)) G C{C n +\) for all t G [a, b}. Therefore 

jraMi(t)+d>o. ( 6 - 8 ) 


i=0 


Since c° G M n +\ , there exists a nondecreasing function <7° which is right continuous 
on (a, 6) and satisfies 


0 > ^2 a t c° + d 

t=0 

= ^2<itSa u i{t)dcr 0 {t) + d 


(6.9) 


1=0 


= fa a <*iUi(t) J da°(t) + d . 


^i=0 


Denote f b da°(t) by A. Since c° # C(C n+l ), c° ± (0,--,0). So A > 0, and integrating 
(6.8) with respect to <7° gives 

0 < / a 6 fiS Au >(o + ^W o (o 


1=0 

n 


= /.ME a,Au,(0 ^°(0 + /^°(0 


^1=0 


= H 


da 0 


(t) + A d. 


Dividing by A gives 


0 ^ fa (X^ a i u >(0 ) dcr°(t) + d. 


\i = 0 


This contradicts (6.9), so c° G C(C n +i), and .M„+i — C(C n + 1). 


Definition 6.12 The index 7(c) of a point c in M n +\ is defined to be the minimal 
number of points of C„+ 1 that can be used in a convex representation of c under 
the special convention that (uo(fl), txi(a), • • • , u n (a)) and (uq(£>), Uj( 6), • • • , u n (6)) are 
counted as half points while (u 0 (i), Ui(t), * • • , u„(f)) receives a full count for a < t < b. 
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Theorem 6.16 A vector c° € M n +u c°^0,isa boundary point o/M n + 1 'f and onl y 
if I(c°) < (n+l)/2. Moreover, every boundary point c° admits a unique representation 

«° = * = 0 , 1 , • • 4 ( 6 - 10 ) 
j= i 

where p < A, > 0, j = 1,2, •••,?, and t x < t 2 < ■ ■ • < t p . 

Proof: First, we let c° ^ 0 be a boundary point of M n + i an d s h° w t * iat ^( c °) < 
a±l. Since M n+1 is closed, c° € M n+ „ and by Theorem 6.13, there exists a support- 
ing hyperplane to A4 n+ i at c°. That is, there exist real constants a 0 , • • • ,a n , not all 
zero, and a real constant d for which 

^Pa;Ci + d>0 for all c = (co, • • • ,c„) € Af n +i (6-1 1) 

«=o 

and £a iCi ° + d = 0 l° r c ° = ( c o°» ” ‘ i Cn °) (6 ' 12) 

1=0 

Since (0, •■•,()) € M n + u (6.11) gives d > 0. Suppose d > 0. Then (6.12) implies 
a{Ci Q < 0. So there exists a positive real number A such that 

x (£, a < c ’°)+ d<o - (6 ’ 13) 
But Ac 0 € Af n +i for each positive real number A, so by (6.11), 

£r=o«.(Ac,°) + d>0 

or A (i;”-o o«Ci°) + d > 0 


This contradicts (6.13), so d = 0, and we have 


H?=o a « c ' — 0 f° r a ii c € 1 , and 

Er= 0 a,c, o = o. 


(6.14) 


Define the function u° by u°(<) = E"=o a . u »(0- For each 1 e ( a ’ ( u o(0> * ’ ’ » u "(0) € 

M n +i- To see this, choose the step function a t which is right-continuous and has its 

only point of increase at t of magnitude 1. Then for each i, 


Ui(t) = ;‘u i(r)<f<7,(r). 
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So u°(f) > 0 by (6.14). Since c° G M n+ i, there exists a function <7° such that 
c { 0 = faUi(t)da°(t) for each i. Then faU°{t)da°(t) = 0. Since u°(t) > 0 for all 
t G [a, b] and <7° is nondecreasing, then u° vanishes at every point of increase of a 0 . 
So < 7 ° can only have a point of increase at a zero of u°. By Theorem 6.2, the number 
of zeros of u° is less than or equal to n, and they will be denoted by ti,t 2 > ' ' ‘ 
where k < n. Then for some positive constants Aj, • • • , A*, 

Mt)dc 7°W = EV.(«;) 

j=i 

for each i. So c° has the representation 

c° = ^A i (u 0 (^),---,u n (t J )). 
j = i 

Since u°(f) > 0 on [a, 6], all zeros except a and b are nonnodal. So 27(c) < Z{u°). 
(See Definition 6.5.) By Theorem 6.3, Z(u° ) < n, so 7(c) < 

To show that the representation (6.10) is unique, let T = {rjjJJ be a set of distinct 
points in [a, 6] containing all the zeros of u°(t). If c° has a representation of the form 
(6.10), then there exists a function a 1 such that c° = (/a«o(0^ <rl (0> ‘ * ‘ 
and the points of increase of a 1 are zeros of u°(t). The system 

C0° = Ad^O^o) + • ' ■ + ^nM r n) 

C„° = l3 0 U n {T 0 ) + • • • + /?n u n( T n) 

has a unique solution for fa, • • • ,/3 n since the determinant of coefficients is nonzero. 

Now let c c denote a vector of A4 n+ i for which 7(c°) < !i y i . Then c° = (co°, • • • , c„°) 
has the representation Cj° = Y%=\ AjUi(tj), where Yl P j= i w (^j) = 27(c°) < n + 1. (See 
Definition 6.6.) Then ^ =1 u;(^) < n and, by Theorem 6.4, there exists a nontrivial, 
nonnegative u-polynomial u°(<) = a,u,(f) for which u°(tj) = 0 for each t y If 

c = (co, • • • , c n ) G C(C n + 1 ), then by Theorem 6.12, c has the representation 

C n+2 n+2 

j=i j= i 
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where each Sj > 0. Then 

n+2 n + 2 

a 0 Co + ha^Cn = a 0 £ ^ u o( r i) + ' ' ‘ + ° n X] 

i - 1 J =1 

= ^i(a 0 n 0 (Ti) + --- + a„u„(r 1 )) + ••• 4- 

^n+2( a O u o( T n+2) 4" ’ * * 4" Qn u n{ T n+2 )) ^ 0* 

Also, 

v P 

a 0 Co° + H a nC n ° = a O ^J U o( < i) + Z- 

j=l j=l 

= Aj [do^o(^l ) 4" * ' ‘ 4" On^n(^l)] 4" ' ' ■ 4* 

A p[a 0 u 0 (fp) 4- • • ■ 4- a n «n(<p)] = 0. 

So the points satisfying a 0 Co + - ■ - + a n c n = 0 form the supporting hyperplane to M n +i 
at c°. Therefore, c° is on the boundary of M n + i- D 

Before looking at interior points of M n + 1 , it is helpful to define a section of M n+ j. 

Definition 6.13 A section o/ .M n+1 is any subset S of M n +i tfie following 

properties: 

1. S is contained in a hyperplane. 

2. Ifc° e M n + 1 and c° ^ 0, then there exists a unique positive real number A such 
that Ac 0 6 S. 

Theorem 6.17 S is a section in A4 n +i if and only if there exist real constants 

a 0 , • ’ • ,a n and a positive constant a such that u(t) = E? =o a i u i(0 > 0 f or a ^ ^ ( a ’ 

and n 

S = {(Co, • • • , c n ) G Mn+l I a * C « = Q ) ^ 6 - 15 ^ 

1=0 

Proof: First, suppose 5 = {(co, •••,c n ) € M n +i | o a « c » = a}, where ]C, =0 a i u M) > 
0 for all t € [a, 6], and a > 0. Let c € S. Then clearly c belongs to the hyperplane 

defined by £"=o a *' x » = Q * 

Let c = (co, • • • , Cn) e M n +i and c ^ 0. By theorem (6.S) c has the representation 

( n+2 n+2 

yi AjUo(tj), • • • , Aj' u n(^j) 
i=i j=i 
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where X 3 > 0 and a < t 3 < b for j = 1, • • • , *» + 2 - So 


n+2 


fltCt = X] ( a ' 


*=o 


i=0 


j=l 


n+2 / n \ 

= 

j=l \ i=0 / 


Since «(t) > 0 for nil f € (a, »], ZU > »• Since c * °’ we hiVe A ' * ° 

some ; and therefore £“ =0 e.c, > 0. So there exists a unique positive constant A such 

that Er=o fli(Aci) = a, and Ac 6 5. This shows that 5 is a section. 

Now suppose 5 is a section in AW Then 5 lies in a hyperplane defined by 
£« =0 a, x , = a, where a > 0 and E?=o <*.' 2 > The f ollowin 8 argument shows that 
a > 0 * Suppose a = 0, and let c G M n+1 . Then there exists a positive real number A 
such that Ac € 5. If c = (co, • • • , c n ), then a, Ac, = 0 and therefore £"=0 a,c, - 0. 

For every t € [a, 6], MO. * • * .««(0) € M n+l and thus Ef =o <W0 s 0 on [a, 6] By 
Theorem 6.2, the number of distinct zeros of a non-trivial u-polynomial is less than 
or equal to n. So Zi-o*? = 0, which contradicts the hypothesis that Er=o a . 2 > °- 
So a > 0. For any t G [a, 6], there exists A > 0 such that A(u 0 (t),-- • » u »(0) ^ $■ 

Since S hes in the hyperplane defined by E?=o a « z » = a > we ^ ave £«=o a *^ u '(0 ” Q 
and thus E?=o a t u t (t) > 0- Let c = (co, • • • , c n ) be any element of M n +> for which 
£n_ o a tCx - a . Since 5 is a section, there exists A 6 1 such that Ac 6 S. Then 

i:r=o o.Ac, = a and therefore A = 1. So c € 5, and 

n 

{ (Cq, • • • , C n ) € Mn+l | E a * C « = C 


i=0 


Therefore 5 = {(co, • * • , c„) G M n +\ I E"=o a < c < ~~ a )‘ 

Theorem 6.18 Let S be a section in M n +\- Then S is convex and bounded. 


□ 


Proof: If 5 is a section in A4 n +i, then by the previous theorem, there exist 
real constants a„, ••■,<>„ such that u(t) = £? = „aiu,(i) > 0 for all t 6 [o,6], and 

S = {(co,---,c„) € Af n +i I ESrtttiCi = ^ Since > 0 on 

min a <i<(, u(() = M for some positive number M. Let c = (co,---,c„) 6 5. Then 
c S At„+i, and there exists a right continuous nondecreasing function a such that 
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d = f*Ui(t)dcr(t ) for each i. So 

n 

Q = 

i =0 

= i: 

t =0 

= /a ^ 

> Mfcda{t). 

So f*d(i(t) < ct/M. Since u, is continuous on [a, 6], it is bounded there, so 

|Ci| = |/‘uj(()<f<x(l)| 

< ;‘K(0I^(1) 

. Of 

< max ud<) — . 
a<t<b ' M 

So S is bounded. 

To show that S is convex, let u and v belong to S, and let t £ [0, 1]. By Theorem 
6.11, tu + (1 — t)v £ A4„ + i- By Theorem 6.17, there exist real constants a 0 , • • • ,d n 
and a positive constant a such that 

n 

S = {(Co, • • • , C n ) € A4 n+1 I Y. a ' c ' = 

i =0 

Let u = (u 0 , •••,«„) and v = (u 0 , • • ■ , u n ). Then 

n n 

Y a,Ui = a = Y a ' v i- 
i = 0 1=0 

So 

n n 

O = t Y a i U i + (1 — t) Y a « u » 

t =0 i =0 

n 

= Y a i( tu i + (1 “ 0 U »), 

i=0 

and therefore tu + (1 - t)v belongs to 5. D 
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Definition 6.14 Let c = (cq, • • • , c n ) € M n+l . The values {/,} in the representation 

P 

* = 0,1, •••,»*; \ } > 0, 1 <j<P 
j=i 

are called roots of the representation. If t m is a root, we say that the representation 
involves t m . The nondecreasing function a which concentrates all its increase at the 
points of Ml with respective weights { A ; } is referred to as the associated measure 
of the representation. (Note that c, = Ui(t)da(t) = Tl P j=i Ajt*«(fj)0 The index of the 
set T = {<!,•• • ,t p } is defined to be that number obtained by counting interior points 
as one and the end points a and b as one half. The index of the representation of c 
and the index of the measure a generating c are each defined to be the index of the 
set of roots of the representation. 

Note that, if a representation of c £ M n +i involves the minimum number of 
points, then the index of the representation is the same as the index of c, /(c), in 
Definition 6.12. 

Theorem 6.19 Let c = (cq, •••,c„) be an interior point of A4 n+1 . For each t~ £ 
[a, 6], there exists a representation 

p 

c, = for i = 0,1,* --,n; Xj > 0; 

j - 1 

j = 1,2, •••,?; t" € • • • , t p } 

of index (n + l)/2 or ( n + 2)/2. 

Proof: Let c be an interior point of Ni n +\ and t" £ [<*.&]. Suppose there exists 
A > 0 such that c— A(it 0 (<"), •• • ,u n (t x )). If n = 1 and either t M = a or t“ — b , then 
the index of c, /(c), would be A. But by Theorem 6.16, c would be a boundary point. 
So t' € (a, 6), and c has a representation of index 1. In this case the conclusion of 
the theorem is satisfied. If n > 2, then the index of c, /(c), satisfies /(c) < 1 < 
Again by Theorem 6.16, c would be a boundary point. So there does not exist A > 0 
such that c = A(u 0 (^*)i ■ • • , u n (f*)) in case n > 2. 

Consider a section S € A4 n+ i containing c. Let c* = A(u 0 (C), • • • ,u n (<*)), where 
A > 0 and c* € S. In case n = 1 and c* = c, the theorem holds, so assume c ^ c*. In 
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case n > 2, it is necessary that c* ± c. Since S is bounded, the ray c' + ^c-c"), t > 0, 
lies in 5 and pierces the boundary of M n +\ at a point c = c m + t(c — c ), where r > 1. 

So c = ac + (1 - a)c* for a = 1/r e (0, 1). 

If /(c) < then 7(c) < + 1, and by Theorem 6.16, c would be a boundary 

point of A4 n +i- So 7(c) > Sf 1 , and we have < 7(c) < Then 7(c) = ^ or 

а, and therefore 7(c) = or Since q and (1 — a) are both positive, Theorem 

б. 16 guarantees that c has a representation 

a = 

j = 1 

where i = 0, • • • , n, and \j > 0 for each j — 1, • • • , p. ^ 

Definition 6.15 Let c be an interior point of M n +i ■ A representation for c of index 
/( c ) = (n + l)/2 is called principal, and any representation of index 1(c) < (n + 2)/2 
is called canonical. A canonical or principal representation is designated upper if it 
involves the end point b and lower if it does not. 

Theorem 6.20 Let c be an interior point of M n + i f or n > 2. There exist at least 
two principal representations. If n = 2m, one representation is found by prescribing 
t* = a in Theorem 6.19, and the other is found by prescribing t" = b. The roots, 
respectively, are 

a = t\‘ < ^ 2 * < * ' ■ < tm+i* < ^ 
a < Si* < s 2 m < ■ ■ • < s m+ T = b 

If n = 2m + 1, one principal representation is found by prescribing t" — a to get roots 

a = sj* < s 2 ’ < ■ • • < 5 m+2 “ = b 
The other principal representation has roots 

a < ti m < t 2 m < • • • < *m+r < b 

Proof: Let c be an interior point of M n +\- First suppose n = 2m for some 
positive integer m. By Theorem 6.19, there exists a representation of index ^ or ^ 
involving t“ = a. In the proof of the theorem, it was shown that c has a representation 
of the form c = etc + (1 — o)c*, where 0 < a < 1, c* = A(tio( a )i ■ ■ * » u n( a )) f° r some 
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A > 0, and c is on the boundary of Ai n +i- Since 1(c) < = m + |, there exists a 

representation of c of index less than or equal to m. If the representation of c is less 
than m, then the index of the representation c = ac+(l — a)c* is less than m + But 
this would mean c is a boundary point of M n +i- So 1(c) = m. The representation 
of c does not involve a because if it did, the index of the representation of c would 
be equal to the index of the representation of the boundary point c. Since m is an 
integer, the representation of c cannot involve b. So the representation of c involving 
a has index m + ^ and thus is principal. The representation does not involve b. A 
second principal representation not involving a is determined by letting t“ = b. 

Now let n = 2m + 1 for a positive integer m. As above, let t m — a, c* = 
A(u 0 (a), • • ■ , tt n (a)) with A > 0, and c = ac + (1 — a)c m for 0 < a < 1. Then 
1(c) < So 1(c) < m + j. If 1(c) < m + then the given representation for c 

would have index less than m + 1, contradicting the fact that c is an interior point. 
So 1(c) = m + x, and the representation must involve either a or b. It cannot involve 
a because, in that case, the representation of the interior point c would have the same 
index as that of the boundary point c. Thus the representation of c involves both a 
and b and has index m + 1 = So it is principal. 

To construct another principal representation define the set 

C n +i(d) = {(Ui(t), ■ • • , u„(f )) | d < t < b) 

where d € (a,&)- Let M n +\(d) be the convex cone spanned by the curve C n+ \(d). 
First, we show there exists d! > a such that c belongs to the boundary of M n +\ (d 1 ). 
Suppose no such d' exists. Define the sets Ai and A 2 by 

A\ — {d | c is an interior point of M. n +.(<*)} 

A, = {d \ c e 

The set A\ is not empty. To see this, note that since c is an interior point of A4 n +i, 
Theorem 6.20 guarantees it has a principal representation not involving a. Let t m be 
the smallest root, and choose d to satisfy a < d < t m . Then c is an interior point of 
A / f n+ i(d) because the index of c with respect to [a, 6] is the same as the index with 
respect to [d, 6]. 

The set A 2 can be shown to be nonempty by considering separately the cases 
where n is even and where n is odd. First, choose a principal representation of c with 



47 


respect to [a, 6] which does not involve b. Denoting (u 0 (<), • • • , u n (t)) by U(t), c can 
be expressed as 

c = EW 

j=i 

where each A j is positive. Let tp be the largest root, and choose d — t p . 

First, suppose n — 2 m for a positive integer m. Then the above principal rep- 
resentation involves the endpoint a, and p = m + 1 - If c € A4 n+ i(d), then it has a 
representation involving d of index less than or equal to 2 f 1 . Let 

C = 52ljU(sj), 

J=1 

where s a = d, q < m + 1 , and each 7 j is positive. Then 


0. 


Since t p = s 1? we have 

= 0, 

j=i 

where {r x , • • • , r fc } is a set of k distinct points in [a, 6], and fc< 2 m + l = n+ l. Since 
{tio(t), • • • ,«n( 0 ) is a T-System, this would mean each a 3 = 0 for j = 1 , • • • , k. But 
this contradicts the assumption that each A ; in the principal representation of c with 
respect to [a, 6] is positive. 

Now suppose n = 2m + 1 for m a positive integer. Since the representation of c 
does not involve b, then by Theorem 6.20, it does not involve a. So p = m + 1 . If 
c € A4 n +i (d), then c has a representation in A4 n+ i(d) of index less than or equal to 
m + 1 . If the index is equal to m + 1 , then c is an interior point of M n +i{d), and 
the representation can be selected so that neither d nor b is involved. So c has a 
representation 

c=E*), 

j=i 


where q < m + 1 . Then 


= ewi-e*) 


= E>yl/(r ,). 


j=l 


0 
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where A:<2(m + l)=n + l. As before a contradiction is obtained because 
{u 0 (<), • ■ • , it n (£)} is a T-System. So both A x and A 2 are nonempty, and they can 
be shown to be open. Suppose c does not belong to the boundary of A4 n +i(<i) for any 
d > a. Then A\ and A 2 provide a decomposition of the interval (a, b) into the disjoint 
union of open sets. But this contradicts the fact that (a, b) is connected. Therefore, 
there exists d' G (a, b) such that c belongs to the boundary of Ad n +;(d / ). By Theorem 
6 . 16 , since c is a boundary point of M. n +\{d!), the index of c related to [d',6] is less 
than or equal to m -f If the index were less than or equal to m, then the index with 
respect to [a, 6] would be less than or equal to m + \- But this would contradict the 
fact that c is an interior point of Ai n +\. So the index of the representation related 
to [d', 6] must be m + 1 . If b is involved, then since m is a positive integer, d' is not 
involved. In this case, the index relative to [a, b] would also be m + 5, contradicting 
the fact that c is an interior point of A 4 „+i. So b is not involved and therefore d' 
must be involved. This representation has m + 1 roots all belonging to (a, 6) and by 
definition is principal. D 

Theorem 6.21 Let c = (cq, • ■ • ,c„) be an interior point of A 4 n +j and let er* and a 
represent two different measures satisfying f pidcr* = / p,d <7 = c, (i = 0 , 
where <7 has index or 2 ± 2 . Let T = {£1, £ 2 , ■ • • , t p } be the roots of a, where 
ti < £2 < • • • < t p . Then for every pair of roots tj and 1 of a lying in the open 
interval (a, b), there exists a point of increase of a* in the open interval (tj, tj+i)- V 
a has index this remains true if t } = a, or tj +l = b. 

Proof: First, note that since a and a* are measures, then they are right continuous 
non-decreasing functions which concentrate all their increase at their roots. Let tj 
and t 1+ 1 be consecutive roots of a. Note that the existence of two distinct roots 
implies that n > 1. If the index of a is !L ±l i assume these roots belong to the interval 
( a,b ). In this case n must be greater than 1, for if n = 1, then the index of a would 
be 1^. This would mean there are only two roots, and one is an endpoint. So there 
are not two distinct roots in (a, b). If the index of a is one of the roots can be 
an endpoint. 

Suppose cr* does not increase in the interval (tj,t ]+x ). Let T = {£1, £2, • • • , t P } be 
the set of roots of a and define a function u : T — + { 1 , 2 } by 

u>(£.) = 


1 if ti € { tj , £j+i , 6} 
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w(ij) — 2 if f,' ^ 1,6}- 


Under the above assumptions, it follows that 


^ n - 

»=i 

By Theorem 6.5, there exists a u-polynomial u such that u(t) ^ 0 for t € (a,b) — T 
and such that t, is a nodal zero if u(ti) = 1 and a nonnodal zero if u ?(<,) = 2. Since 
u;(tj) = 1 = u(tj+ 1 ), tj and t J+ x are nodal zeros of u. So u must change signs at tj if 
tj ^ a and at tj+\ if tj+ 1 / b. Then (after multiplying by —1 if necessary) u vanishes 
on (a, b ) precisely at the roots of cr and has the following properties: 


u(t) 


>0 if 
<0 if 


Now u(t) = E”=o a t«.(0 for constants a 0 ,ai, - • • ,a n and by assumption / u,dcr* = 
f Uida for each i = 1, 2, • • • , n. Since u(t) = 0 at the roots of <7, then / u(t)dcr = 0. 
Also, <j * has no points of increase in 1 ), so 

0 = J u(t)(dcr* — da) — J u(t)dcr* 

= f u(t)dcr * + [ u(t)d<x* 

Since u(t) > 0 for t ^ [t } , tj+i], if cr * increases at a point which is not a root of <7, 
then /[ a t ] u[t)da * +cr[ t]Jrl ^u{t)dcr* > 0. This contradiction implies that the roots of 
<7* are also roots of a. If a has index then as indicated above, n must be greater 
than 1, and the number of roots of a is less than or equal to ^2 + l. If a has index 
the number of roots is less than or equal to + 1. In either case, the number 
of roots of <7 is less than or equal to n + 1. Since {u 0 , Ui, • • • , u„} is a T-system, a 
and <7* cannot be distinct. Therefore, if a and cr * are different representations, then 
cr * must have a point of increase in the interval □ 


Theorem 6.22 For each c in the interior of M. n +\ , there exist precisely two principal 
representations. The roots of these representations strictly interlace. 


The proof of this theorem is given in [28 . 
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Example: Let tt 0 ,ui,u 2 , «md u 3 be the functions 

tio(«) = l «i (t) = t u 2 {t) = t 2 u 3 {t) = t 3 

These functions form a T-System on [0, 1], and M 4 is the smallest convex cone con- 
taining the set 

C 4 = {(u o (0,«i(*)> u 2(*)> u 3(0) I 1 e l 0 ’ 1 )} 

Let c be the point 

c = (co,Ci,C 2 ,c 3 ) = 1 dt, J* t dt, £ t 2 dt, £ t 3 dt) = (1, 1/2, 1/3, 1/4). 

By definition, c € M 4 . Also, c is not a boundary point of M 4 , so by Theorem 
6.22, there exist precisely two principal representations of c, and the roots of the two 
representations strictly interlace. By Theorem 6.20, the two sets of roots must be 
{si,s 2 ,s 3 } and {fi,t 2 }, where 

si = 0, 0 < s 2 < 1, s 3 = 1 and 

0 < tx < t 2 < 1. 

In the first case, the representation of c = (co,c 1 ,c 2 ,c 3 ) is given by 

c, = Qiu,(si) + a 2 u,(s 2 ) + a 3 u,(s 3 ) 


where 

Si =0, s 2 = 1/2, s 3 = 1 


Qi = 1/6, o 2 = 2/3, a 3 = 1/6. 


The second representation is found by Gaussian quadrature to be 


C, = + PlU t (t 2 ) 


where 

(, = a .21 1 I, = 2^3 s- .789 

0i = 1/2 0i = 1/2 

The index of each representation is 2, and the roots do strictly interlace. 
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Example. Consider the same collection of functions discussed in the previous 
example with the addition of the function u 4 (t) = t 4 . Then 

Cs = {(uo(t),Ul(t),U 2 (0,«3(0> U 4(0) M € [0, !]} 

Let c be the point 

c = (co, c 1; c 2 , c 3 , c 4 ) = 1 dt, £ t dt, £ t 2 dt,£ t 3 dt, jf t 4 dt) = (1, 1/2, 1/3, 1/4, 1/5). 

Then c G M.$ and c is not a boundary point. By Theorem 6.20, the roots of the two 
principal representations are {<i,J 2 ,< 3 } and where 

0 = ti < t 2 < h < 1 and 

0 < si < s 2 < s 3 = 1. 

In each case the index of the representation is 2x. The roots of the representation 
obtained by Gaussian quadrature are 

1 _ vT5 1 1 vT5 

2 ~ 10 2 2 + 10 ' 

Although all representations involve the same number of points, the index of the 
Gaussian quadrature representation is 3, and therefore it is a canonical representation 
but not principal. 

The following two theorems are stated without proof. The proofs can be found in 
[28]. 

Theorem 6.23 Let c belong to the interior of A4 n+1 . Consider two different repre- 
sentations of c of index less than or equal to ( n + l)/2 as follows: 

Ci = V u «(*/) = Yi V u <(*/)> * = 0, 1, • • • , n. 

j=i j = l 

Then p and q belong to the interval + 1 , where [r] denotes the greatest 

integer < r. The roots {t/)i and {t/'}’ strictly interlace in (a, 6) but may possibly 
share one or both of the end points a or b. 

Theorem 6.24 Let c be in the interior of M n+ 1. For any t~ satisfying a < t" < b, 
there exists a unique canonical representation of c involving t" . 



CHAPTER VII 

EXPONENTIAL INTERPOLATION 


In the method of Gaussian quadrature, the integral of a polynomial p in CP„_i over 
a closed interval [a, 6] is represented by the sum 

f*p(t)dt = U= ihp{tk) 

where the nodes, t x , f 2 , * • • , t n , belong to [a, 6], and the weights, A x , A 2 , • • • , A n are real 
numbers. Such a representation is exact for all polynomials in IPn-i, and if the nodes 
are chosen appropriately, it will be exact for all polynomials in 3P 2n -i- In this chapter, 
Dirichlet polynomials will be considered. A Dirichlet polynomial is a function / which 
has the representation 

f(t) = aoe Ao< + aje Alf + • • • + a n e Anf (<•!) 


where ao, a 2 , • • • , a„ are real constants and A 0 , Aj, • • • , A n are complex constants. 

Suppose / : [a, 6] — ► IR is a function defined by 7.1 and assume A 0 , A 2 , • • • , A n are 
given real constants such that Ao < Ai < ••• < A n . Assume also that to,ti, • • ■ ,t p 
are given points in [a, 6] with to < ti < ■ • ■ < t p , at which the values of / are 
known, and let /(<*) = w* for k — 0, 1, • • • ,p. Theoretically, if p is large enough, the 
value of the integral /*/(£) dt can be determined without knowledge of the coefficients 

<* 0 , ^ 1 5 * ’ * 5 

Suppose the value of / is known at n + 1 distinct points t 0 , t 2 , •••>*»»• The set of 
functions {exp Afct }£ was shown in Chapter VI to be a T-system. So the determinant 


exp(A 0 t 0 ) exp(A 0 ti) ••• exp(A 0 t n ) 
exp(A 1 < 0 ) exp(A 1 t 1 ) ••• exp^t,,) 


I exp(A n t 0 ) exp(A n <i) ••• exp(A n t n ) | 

is positive whenever a < to < tj < ••• < t n < b. Therefore, the coefficients 
a 0 , ai , • • • , a n are given by 


(a 0 \ 


[ W °\ 

ax 

= U~ x 

Wi 

a " ) 


V / 
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The integral can be calculated using the computed values of a 0 , a x , • ■ • , a n . But if the 
matrix U is large or poorly conditioned, or if / is not known at n + 1 points, then 
this method of computing the integral is either inefficient or impossible. 

Define a vector c = (co, Ci, • • • , Cn) in E n+1 by 

c, = / 0 6 e Alt dt. 

Then by Definition 6.8, c € M n + 1- By Theorem 6.16 and Theorem 6 . 23 , there exists 
a representation of c of the form 

p 

d = ^ ] 7j^ 3 , i — 0, 1, • • • , ti 

j=i 

where 71,72, • * * ,7 P are real constants, a < t x < t 2 < ■ ■ - < t p < b, and p < 

(The notation [r] denotes the greatest integer less than or equal to r.) Then 

timdt = a 0 jy^dt + ai jy it dt + --- + a n fy nt dt 

= aoY, iy otl + °1 Z 7 y t] 4 ■ ■ ■ + a n £ iy nh 

J=1 J=1 J=1 

= 71 E keXktl + 72 E keXkt 2 + • • • + 7p £ keXktp 

k - 0 Jt=0 k=Q 

= 7i/(*i) + 72 /U 2 ) + • • • + 7p/(^p)- 

For example, let 

f (t) = cto e * 4 " a iC 2t 4 * <*2 e 4 a 3 e 



By making the substitution 

x = e~ l and 

g(x) = a 0 4 a^x 4 a 2 x 2 4 a 3 x 3 , 

the integral of / over the interval [0, 00) becomes 

[ f{t)dt = [ g(x)dx. 

Jo Jo 

As mentioned in the previous chapter, one of the principal representations of the 
point ^ 1 

c = (cq, ci,c 2 ,c 3 ) = (J 1 dx, J xdx,J x 2 dx,J x 3 dx) 



54 


is given by 

d = 0\Ui(Xi) + /?2“i(*2), 

where the roots x 2 and x 2 are those found using Gaussian quadrature. When using 
this method, the orthonormal functions 

*o = l 

*! = \/3(2x — 1) 

* 2 = v^x 2 - 6x + 1). 

are computed, and X\ and x 2 are the following zeros of the polynomial *2 : 

3 — y/3 


Xl = 


x 2 = 


6 

3 + v/3 

6 


« .211 


.789. 


Using the Lagrange method of calculating the coefficients, we have 

_ f l x — ; 

fix — I 

J 0 Xi — 


Xl — X2 

x x 


x 2 1 

dx = - 


r It — r, I 


dx = -. 
x 2 — X\ 2 


. t x x — : 

02 = 

Jo x 2 — 

The resulting representation is 

J g(x)dx = ao J Idx + aiJ x dx + a 2 J x 2 dx + 03 jf x 3 dx 

= fiig(xi) + (hg{x 2 )■ 

Since x = e - *, then t = — ln(x) and g{e~ t ) — e* f(t). So 

roc 1 1 

l f(t)dt = je''/(<.)+2 £ "/(<2) = 7./(f.) + 72 M) 

where the roots ij and t 2 and the weights 71 and 72 are given by 

'3->/3\ 


= —In 


t 2 = - In 


, 6 , 
'3 + n/3' 
6 


7i 


= I, = l (3 -JS- 

2 2 V 6 


-1 


1 ta 1 (Z + Vl' 

72 = 2 e = 2 “T~ 
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The above procedure is applicable whenever the constants A 0 , Aj, • • ■ , A n are con- 
secutive negative integers, and it suggests the possibility of applying quadrature to 
Dirichlet functions which do not have consecutive negative integer constants. For 
example, let 

f(t) = a 0 + dje - * + a 2 e~ 4t . 

Define functions it 0 ,tii, and u 2 by 


u 0 (t) = 1 

Ui(t) = e~‘ 

u 2 (t) = e~ 4t 


A representation of the point 

c=(^J uo dt, dt, jf u 2 (t)e~ l dt'j 

can be found which involves only two roots. As in Gaussian quadrature, the set 
{uo,U!,u 2 } is used to construct the orthonormal set where 

*o(0 = 1 

*i(<) = N/3(2e-‘-l) 

*.(«) = |(<e-«-4e-‘ + l). 

The zeros of $ 2 (t) are 

= .1951176928 and t 2 = 1.365272252. 


Choosing 


7 ! = .43122554915 and 7 2 = .568774508 
gives a representation of c of the form 


Ci = 7iUi(*i) + 72«.(<2) 

for i = 0,1,2. The integral of the function / is given by 

f f{t)e- l dt = 7 i/(ix) +72/(*2)- 
Jo 
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Other representations can be found by first making the substitutions x — e * and 
g(x ) = ao + Q\X + a 2 x 4 to get 

/ f(t)e~ t dt= f g{x)dx. 

Jo Jo 

Then Gaussian quadrature is applicable. This requires the computation of an or- 
thonormal set and the zeros of tf 3 are the roots required in the 

representation. The function $3 is determined to be 

tf 3 (x) = v / 7(20x 3 - 30x 2 + 12x - 1) 


and the zeros are 


Xi 


5-^15 
10 ’ 


x 2 — x 3 ~ 


5 + yis 

10 


The weights are determined to be 

= 5/18, S 2 = 4/9, S 3 = 5/18. 


Then 


where 



/ g(x)dx 
Jo 

Si g(x x ) + S 2 g(x 2 ) + S 3 g(x 3 ) 
Sif(ti) + S 2 f(t 2 ) + S 3 f(t 3 ) 


ti = — lnxx 

t 2 = — In x 2 

t 3 = - In x 3 


2.183011081 

.6931471806 

.1195740121. 


In the above representations, the first required knowledge of the function / only at 
two points, whereas the second representation, which was found by use of Gaussian 
quadrature, required three points. The index of the first is 2, and the index of the 
second is 3. There are two other representations for the integral 

/ g{z)dx 

Jo 

which each have index li. Each of these is a principal representation. One involves 
the nodes Z\ and z 2 , where z x = 0 and 0 < z 2 < 1. The other involves nodes rj and 
r 2 , where 0 < r x < 1 and r 2 = 1. These yield the following representations. 
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f°° Me-' dt 

J 0 

= 7i/(2i) + 72 /( 22 ) 

fl = 00 

t 2 = .3054302439 

7 x = .3213955956 

72 = .6786044044 


*♦0 

7 

•v» 

8 

— ^i/( 5 i) + ^ 2 /(^ 2 ) 

s x = .9432378690 

O 

II 

6 X = .8188198595 

62 = .1811801405 


In general, it would be desirable to find an efficient way of representing Dirichlet 
polynomials which have exponents with arbitrary negative coefficients. For 
example consider the function / given by 

/(f) = e~ 2t + e -t + e~ ^ + e“K 

The functions e _2t , e -< , e _ J* and e - 3 J form a T-System. So there should exist 
constants 71 , 72,73 and nodes ^ i , < 2 , ^3 such that 

roc 

/ /(*) • e~*dt = 71 f(ti) + 72 /(t 2 ) + 73/(23)- 

Jo 



CHAPTER VIII 


QUADRATURE METHODS 

Let A be an n x n constant matrix, c a constant vector in IR n , and I an interval. 
Let x be a function from / to 1R". Consider the linear system 

<*i ^ 

“ 2 („ € I ( 8 . 1 ) 

<■» / 

It can be assumed that t 0 = 0, since an appropriate translation of the variable trans- 
forms any system into this form. Assume that I has the form [0,oo) or [0,6] for a 
real number 6 > 0. Assume also that the system is discretely observable with respect 
to the set of points {ti,t 2 , ' • * , <m} C I- 

The solution for x in (1) is given by 

x(t) = exp(Ai)x(0) 

The n x n matrix exp(At) has the form exp(At) = ( qij{t )), where each q t] {t) is given 

b y 

= Y, Pijk(i)e Xkt 

Ar=l 

where {A l5 A 2 , • • • , A*} is the set of distinct eigenvalues of A , and each p l} k is a poly- 
nomial of degree less than n. 

Then 

n 

y(t) = c T x(t) = c T exp(At)x(0) = hjliji*) 

ij=i 

where each 6^ is a constant. Regrouping the terms gives 

y(t)=±r,(t)^\ 

1=1 

where each r t is a polynomial of degree less than n. 
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The solution y can be determined directly from knowledge of the vectors c and x(0). 
But it can also be determined from known values of y at points <1, <2, • • * , t m . 

Suppose there exists an invertible constant matrix Q such that 

y(t) = c T exp(At)QQ~ 1 x(0) 

and 

c T exp(At)Q = ( <fo(t) fa(t) ••• <f> n (t) ) 

where <fa (<), • • • , <f>n(t)} is an orthonormal collection of functions on the interval 

/ with respect to the inner product 

(/.«) = ImmMt)- (8.2) 

Then, letting 

Q~ l x(Q) = 

y has the representation 

y = Ui<j>i(t) + w 1 ^ 2 (t) + • • • + u} n 4> n {t) 

on I. Then for each k, the constant u k is determined without knowledge of Q or 
x(0) if {y, 4>k) is known for each k. These values can be determined from the known 
values y(t 1), ^(^2), • • * ,y(tm) if a quadrature formula involving points <1, t 2 , • • • ,t m can 
be found which is exact for all (q tJ , <f> k ), where i,j,k = 1,2, • • • , n. 

If such a quadrature formula can be found, then for each k — 1,2, • • • , m, 

wjt = {y,4>k) (8.3) 

= J y{t)<t>k(t)dti(t) 

= Aiy(ti)<t>k{ti) + A 2 y{t 2 )(pk(h) + • • • + A n y(t m )<f)k{tm ), 

The eigenvalues of the matrix A will determine the form of the solution y , and it 
will be assumed that A is in Jordan canonical form. This assumption leads to no loss 
of generality since every matrix is similar to a matrix J in Jordan canonical form, 
and the linearity of the dynamical system is preserved under a change of variables of 
the form Z = Px, where A = P~ l JP. 


UJi 

U^2 
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8.1 Nilpotent Matrices 

If A is a nilpotent matrix, then the solution is a polynomial and can be written 
as a linear combination of orthogonal polynomials. In special cases, these orthogonal 
polynomials have been computed. In general, they can be obtained using a three term 
recursion formula. Gaussian quadrature is applicable, and formulas are available for 
some particular intervals of integration and weight functions. See [8]. 


8.2 Matrices With One Real Eigenvalue 
Let A be the following n x n matrix in Jordan form: 

(X 1 ' 

A 1 


A = 


A 1 
A 


Then the solution y of the linear system (1) is given by 

y(t) = c T x(t ) = c T exp(>U)x(0), 


where exp(At) is the matrix 


exp(/lt) = exp(A t)M(t), 


with 


(l t t 2 / 2 t 3 / 3! ••• r-V(n-l)!^ 
1 t t 2 j 2 ••• t n ~ 2 /{n — 2)! 


M(t) = 


1 t 
1 


1 2 /2 

t 

1 


Suppose there exists a set {<J>jt}S _1 of polynomials which are orthonormal with respect 
to the inner product 

(f,g) = 
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where 

4>o(t) = Sn 

<f>l(t) = «21 + 5 22^ 

< f >2 (<) = 531 + 5 32 * + 5 33* 2 

</)„_! (t) = S n l + Srft 4 + S nn t 1 

and for each k = 1, 2, • • • , n, s kk ± 0. See [19] for existence of such a set and recursion 
formulas for computation of the polynomials. 

Let J = ( Ci C2 ... Cn ). Since the system is observable, c k ± 0 for each 

k = 1, 2, • • • , n. Therefore the matrix 

' C\ C2 C3 C\ ■ • • Cn ^ 

Cl C 2 C3 • • • Cn - 1 

Cl C2 . . Cr >— 2 

2 2 ' 2 


is invertible. Let 


■Sn S 2 l 531 S41 

S 22 ^32 ^42 

S33 543 


(n-1)’. 


Then there exists a matrix 


9n 912 9i3 9 14 
922 923 924 

933 934 
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satisfying CQ = S. Since each Skk ^ 0, Q is invertible, and 


c T exp(At)Q = c T exp(Xt)M(t)Q 

= exp(At) ( 1 t t 2 ••• t n ~ l ) CQ 

= exp(At) ( 1 t t 2 • ■ • t n ~ l ) S 

= exp(Af) ( <f> 0 (t) fa (t) ••• 4>n-i{t). ) 


Let 


Q-'x{ 0 ) 


Then the representation for y is 


c° \ 
w n _! / 


y(i) = c T exp(,4*);r(0) = c T exp(At)QQ x x(0) 

= exp(Af)(u;o^o(0 + <^ifa{t) + h Un-ifa-i{t))- 


Let / be defined by f(t ) = exp(— A t)y(t). Since the set {<^>t}o 1 is orthonormal, 
each coefficient u>k is given by 

Wfc = {f, fa). 

Since / and <fik are polynomials of degree less than or equal to n — 1, then f<f>k is 
the product of exp(Ai) and a polynomial of degree less than or equal to 2n — 2. 
The functions exp(Atf), texp(Xt), t 2 exp(A£ ),•••, tf 2n_2 exp(Af) form a T-System. So 
there exists a quadrature formula involving n points ti,t 2 ," • ,t n in I and weights 
a 1? a 2 , If the values y(*x),y(t 2 ), • • • , y{t n ) are known, u>jt is given by 

u k = a 1 exp(-A< 1 )y(t 1 )^ fc (<|) + 

a 2 exp(-A t 2 )y{t 2 )fa{h) + h 

a n exp(-\t n )y(t n )<t>k{t n ). 
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8.3 Distinct Real Eigenvalues 

Let A be an n x n matrix with distinct real eigenvalues, and consider the linear 
system (8.1). Assume t 0 = 0, c has no zero components, and 


A = 






where \\ < A 2 < • • • < A n . The solution for y is given by 

y(t) = c T diag{exp (A x <),exp (A 2 f), ■ • • , exp (A n t)}x(0), 


which has the form 

y(t) = &i exp (Ajt) + & 2 exp (A 2 t) H h b n exp (A n <) 

for real constants 6^, 6 2 , • • • , 6 n . 

Using the Gram-Schmidt orthonormalizing process, an orthonormal set of functions 
{^ 1 ,^ 2 , • • • , 4> n ) with respect to the inner product (8.2) can be found satisfying 

<j>i(t) = Sne A,t 

<ji> 2 (t) = S2ie A ' J + s 22 e Ajt 


<M0 = s nl e Ali + 5 n2 e Ajt H hs nn e A "‘ 

where Skk ^ 0 for each k = 1, • • * ,n. Let 

c T = (c 1 ,c 2 ,---,c n ), 


5 = 


/ Sjj S 21 S 31 • • • S n j 

S 2 2 $32 ' ’ ‘ 5 n2 

■S33 ‘ ' ‘ -Sna 


\ 


V 


^nn / 


and Q = diag{l/c,, l/c 2 , • • • , l/c n } 5. 
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Then 

y(t) = c T diag{exp(A x f),exp(A 2 i),- * * ,exp(A n <)} *(0) 

= ( exp(A x £) exp(A 2 £) ••• exp(A n £) ) diag{ci, c 2 , • • * , c„} x(0) 

= ( exp(A x f) exp(A 2 £) ••• exp( A n f) ) diag{ci, c 2 , •••, c*} QQ 1 x(0). 

Note that 


( exp(A x £) exp(A 2 f) ••• exp(A n £) ) diag{c x , c 2 , • • • , c„} Q — 
( exp(A 1 <) exp(A 2 £) ••• exp(A n £) ) S = 

( Mi) Mi) ' • ' <M0 ) • 

Denote the constant matrix Q~ x x(0) by ( u/ x u> 2 • • • uj n ) . Then 


y(0 = ( Mi) Mi) 4*n{i ) ) 


/ui, \ 


= + UJ 2 <f> 2 (t) + * ‘ • + W n (t). 


(8.4) 


Since {<f>\, <f> 2 , • • • , 4>n) is an orthonormal set, the coefficients u>i,u> 2 , • • • ,u in (8.4) are 
given by 

— j y{t) <Pk{t) dn{t) k — 1, 2, • • • , n. 

Since y{t) and <pk{t) are linear combinations of the functions e Xlt , e Az ‘, • • • , e An< , then 
y{t)(j)k{t ) is a linear combination of functions of the form e* A,+A >) < . Let B = {/3 X , /? 2 , • • • , &,} 
be the set of distinct elements of {A, + A ; | i,j = 1,2, •••,«}, where 2n — 1 < s < 

. Then for each k = 1,2, there exist constants Aki, Ak 2 , • • • , Ak 3 such 

that 

y{i)Mi) — AkM' 1 + Ak 2 e 07t + • • • + AkM at • 

Then 

u>k — J^(Aki^ lt + Ak 2 e^ 1 + • • • + Ak 3 e^ at ) 

Suppose I is the closed finite interval [a, 6]. The functions exp(/3i£), exp(/? 2 i ) , ■ • • , 
exp(/3 a t ) form a Tchebycheff system (T-system) over [a, b]. The point c = (ci, c 2 , • • • , c s ), 
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where 

Ci= f exp (Pit)dfi(t), 

J a 

belongs to the moment space M.,. So by Theorems 6.16 and 6.19, there exists a 
quadrature formula involving a set of r points • • ,t r , where r < If any 

of the points are preassigned, a quadrature formula still exists, but it might require 
more than ^ points. The maximum number of points required would be s. 

It is shown in [7] that if a quadrature rule of the form 

/ exp (0it)dfi(t) = £ Qj exp (/?,*_, ) i = 1, 2, • • • , s 

Ja j=i 

exists, then s < 2 n. Since 2n — 1 < s, we have s = 2n — 1 or s = 2n. If, however, 
s > 2 n, the values of y(t) at more than n points would be required. 


8.4 Real Repeated Eigenvalues 

Let A be an n x n matrix with real eigenvalues, where some of the eigenvalues 
are repeated. Let Ai,A 2 ,*--,A, be the distinct eigenvalues. Then the solution y of 
(8.1) has the form 

5 

y(t) = 53 r i(0«p(A;0 

«=i 

where each r, is a polynomial of degree less than n. This can be written as a linear 
combination of the linearly independent functions: 

exp^t), texp(Ajf), t 2 exp(A t t), ••• , f^exp^t), 

exp(A 2 £)) texp(A 2 t), t 2 exp(A 2 f), , t m2 exp(A 2 <), 

exp(A,t), texp(A,t), t 2 exp(A s t), , t m, exp(A,t) 

where m, + 1 is the multiplicity of the eigenvalue A,. 

Theoretically, this collection of functions can be orthonormalized, and the previous 
method could be used to determine a solution of (8.1) by evaluating constants of the 
form 


(8.5) 


u>k = J y(t)<pk(t)dfx(t) 
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Each function y(t)(j>k(t) is a linear combination of the linearly independent func- 
tions: 

exp(/?if), texp(^it), t 2 exp(/3it), , t ni exp(flit), 

exp{fot), texp({3 2 t), t 2 exp(fi 2 t), ••• , exp(/3 2 t)i 

exp (0 r t), texp(firt), t 2 exp((3 r t ), ••• , t nr exp{j3 r t) 

where B = {/?i,/? 2 , * * * ,&} is the set of distinct elements of the set {A, + X : \ i,j = 
1,2, ••*,«}. If each of the above functions is integrable over 7, then they form a 
T-system. In that case, quadrature formulas exist which require or less points, 
where s = r + £,- =1 n r - 


8.5 Complex Eigenvalues 

Suppose the n x n matrix A has complex eigenvalues, some of which are repeated. 
If Ai, A 2 , • • • , A a are the distinct eigenvalues, then the solution y of (8.1) can be written 
as a linear combination of the functions given in (8.6). If for each k, A fc = Uk + ivk 
where u* and v k are real, the solution y can be written as a linear combination of the 
functions: 

exp(uxt) cos |ui|t, texp(uit) cos |vi j<, ••• t mi exp(uif) cos |ui|£, 

exp(uit) sin |fi|f, texp(iqi) sin | | < , ••• t mi exp(uit) sin |iq|t, 

i ! : (8-7) 

exp(u,t) cos t exp(u 3 t) cos ••• t m ’ exp(u 3 t) cos \v 3 \t, 

exp(u,<) sin t exp(u s t) sin ••• t" 1 ’ exp(u s i) sin |u,|t. 

Note that if the Jordan matrix A is similar to a real matrix, then the eigenvalues 
occur in conjugate pairs, and the functions in (8.7) are not all distinct. In order to 
apply the method used for real eigenvalues to matrices with complex eigenvalues, a 
maximal linearly independent set would have to be selected from the functions in 
(8.7), and this set would have to be transformed into a set of orthonormal functions 
{< i>k }. Then, as before, 

y(t) = uJi<t>i(t) + • • • + u 


where 


Wfc = y(t)<j>k{t)dn(t). 
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The function y(t)<f>k(t) would be in the span of a set T, where T is a linearly 
independent subset of a set 

S = \J^-i{exp(akt)cos fat, exp(ctkt)sm fat, 

t exp (ctkt) cos fat, t exp(ajfcf) sin fat, 

t mk exp(a*t) cos fat, t mk exp(ajtt) sin fat). 

Quadrature formulas for such integrals are known for special cases. See, for example, 
[15] and [30]. The existence of a quadrature formula is guaranteed in case S is a T- 
system with respect to the interval of integration. For example, the system of 2m + 1 
functions 

1 , cos t, sin t , • • • , cos mt , sin mt 

is a T-system on any interval [a, 6] of length less than 2ir. Furthermore, if {u^} is a 
T-system and the function r is positive and continuous, then {ru,} is a T-system. 
See [28]. 

Other examples of T-systems are eigenfunctions of Sturm-Liouville operators. Let 
the operator L be defined by 

LW ~ { p t ) + ? * 

where p is continuous and positive on [a, 6]. Let I\(x,y) be the Green's function 
associated with the eigenvalue problem 

L(4>) = A 4> 

with boundary conditions 

4>{c) sin ot — p(a)^ , (a) cos a = 0 
fab) sin/? + p(b)<j)'(b) cos f3 = 0. 

If K(x,y) satisfies certain conditions, then the set of eigenfunctions 4>o, fa ,••• ,<j> n is 
a T-system on any closed subinterval of (a, b). See [10]. 
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8.6 Remarks 

In summary, the determination of a solution y to (1) can be found by a method 
which depends on orthonormalizing a given set of functions which form a T-system 
and finding the nodes and weights in a quadrature formula. In some cases, this 
method has been completely worked out. In the remaining cases, the theory of T- 
systems guarantees the existence of quadrature formulas, but the number of data 
points required might be greater than n. Further study of these remaining cases to 
determine the least number of data points required and efficient methods of deter- 
mining weights and nodes would be helpful. 
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